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ABSTRACT 

This paper diocuoMs nm probteM in the Machine translation of natural 
language, in particular, for translation frM Gersan Into Engl ish. An 
lop I Mentation of mm par to of ths translating proems hie btw built. Tha 
■ye too consists of a Goroan intsrprstlvs orMMr, to toko In Goroon tent and 
output a Mt of seoenUe r ear sten tst Ions, and a generator, to produce Engl i oh 
eehtehcee fnta sthgls ooMRtic represents* Ions. Although beeed on tho 
assunption that understanding Is necessary for correct trans let ion of tent, 
tho ays ten dOM not nou contain an underatandtng ceeponsnt to choose betuoon 
sonant ic roprooontitiono. Tho reprsssntattan of knowledge and ita uoo in 
natural language undorotondlng it a r oo aa rch am that la already undor 
intone I ve invwtlgatlon olootthoro. Tho iaploaentetlen doocribod hara io baaad 
on a eyetewic grapeer analysis of Cornan and English, and It applies and 
sx tends tho uorh o* Ulnogred. Special attention ft paid ta questions of 
sonant ic rspreeentatleh in a nutti-languegs Mtting and to etytlstic ioouoo in 
English generation. 



This report is a revised version of s thssls subnitted to the Depart nent of 
Electrical Engineering en January 23, ISM in partial fulfilment of the 
reguirenonto for the P og r e o t of Raster of Science and Elsctrlcal Engineer. 
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Chapter I ~v Introduction T 

l.i The Problea 
The following paoea describe # Modf I «f the translating P^ocasa* in 
particular, « suttss e^lg*ed,jto ■aoiR$.Gfipfii l JMHt«^4ffR«4uB«-w Engliah 
tranelatlon. the mod* I It not In »ny senas a soeplets ©na, sspsciel ly In ths 
crucial araa of language uj^etandlng, no>lap|ss^ta*loj|,#f eoa* psrts of 
the nodsl, however, hat been written, 1a v I e*^l ? the ^cowi doable history of 
tho mechanical translation probiaa, I should Mr set, *h«t ths objeotlvs of thia 
project uas not to construct a large-seals working systsa but rather to eea 
hou far soaa txittlng progress snd techniouse cojyld go in handling a group of 
prob I ess that cose up in text. 

Doss It s*sn sake, sense to sesak of eechanicil tree«|atlon as an 
independent problsa? In the early €0's it «a» «i^l#racogpliad |hat 
mechanical translate 

If the translating procaes^s strictly a ssttsr .,ef understanding in one 
language and, generating inenplbsr, dees |ha«,f*cl^lefl ..trajiilatlon i^oplea ae 
such asrit attsntlon? J think It is fair te aasuar "u§s" te r tbj slowest ipn. 
Translation sssas to rsquirt ths ful I power |J, language Interpretation but not 
ths full pouer af ge^oraAion. Thehards^ dneldinp. 

what to sag and orgsnizing ths sssssg*. ip gsosrslly not at Issue in 
translation. So transition offtrs a •oaswhat clrcuejcribpd context ui thin 
which to discuss I ssuas of understanding^ lanouegs rspressntstlon, and 
production, 

Ths translation problea also has th* attraction of a resdeb I s output, 
naaoly the translation. Tbe~©utput is • criterion by which success say be 
measured, although, of course thsrs are a number of pitfalls hersj first. It is 
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not hard to rscognize, but rather difficult to describe, what » good 
translation Is. Sscond, a Halted syatea lifca the one diacuessd here can be 
grooaod to accept particular sentences gracefully, so that its performance 
camhpt rlntHi. iMi;:^ai0Mi ^IfNar 'c^:ij|Mtf4ki}g(t'>lf--e«|i process successful ly or on 
the no doubt unlimited aeeunt of text^fer which 11 will sputter and die. Once 
we knot* the scop* of a particular syetee* h o uovs r. ue can use a given output, 
or a lack of one,- ■ as. f ****!*- for qHM$W*&** s*atuet Ions. Through the MT 
problem, ' then, it is possible : to contteer soee probtees auieuii to e nueber of 
areae of natural lenguaas processing. - - . 

1.2 The Evolution of the ProWse 
If the translating systse dlscusssd hsrs ts hot tn sny sense a "solution" 
to the Mechanical translation problsai M does fo o t s sent the evolution that 
has occurred stnee Warren Weaver's 19*9 ssatKendus on the subject, in Ite 2S 
year htstory r aechanieal trsnslat Ion has not realty been Involved uTth a 
e ingle probiett, put rather there have bean s series of brebteis, ae each 
systee that uis.buitt ppfntid!up ptNw>#9|y^MH. nsidld attention. Eerl loot 
attempts at I1T uera essent filly eeehanlced dictionaries, doing substitutions 
on a uord-by-word basis. Any eore lf»wlv»d proceestny ues fsf t to hoban pre- 
and post-editors, Uhen it beeees clear that wortf-by-uord substitutions did 
not produce acceptable translations, the prebisa uss refined to include 
syntactic recognition. Attention in the 8t*s eieretrly 60* s His csntsrsd on 
parsing with, for exaaple, Setttngsr*s breoittlve analysis, or top-doun, 
epproaeh and the work of YvngS'a group at IHT. Ths eors •ophlstlcstsd the 
approach to syntax becaee, however, the sari sharply ths nagging problee of 
syntactic aaPiguity case into focus, 'nechenfeel translation begsn to seen 
semantics as ueli as syhtsxv but "i t uss .he¥ exactly elsar what •eaanttcs . 
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meant. Three approaches rsprssented I Imited answere^. pT,obebj U ties .of 
lexical co-occurence could be uwi to sslect «* "llkel lest" parse, a group of 
semantic categories such •• abs$r»cj-cpncrete cou^ ^uaacl to paatrlct 
participants in a grammatical relation, and a, fj^aat ol^fja^da^could be 
used to choope possible interpretations, and h«^ pp§e)|lo paraff. Ho^fver, 
as Bar^lllel and others re«arked f even.jf T aueh ■j^^^could ola|e 9« , 
reliability, they would have tjif : diawj|t^pr^fr|^|hjt^U "ould.be, 

impossible to predict where, tha i arrora ^*JfW4<iJkWt^& l ?'>WQP&* t t0 
••■antics um necessary. ., ti ^ K . s ... 

In recognition of $^41^1^11^^^/^ ee^hanjpij tranalflion 
problem, support .for short-term, l.%, p^^jcaj , 4 prp ^|c|s dr,jed. wp ?<; Jhj ; ear ly 
68's saw a shift away from attempts to build working systems to an emphaele on 
«ore basic rssearch. Limited deduction cmm Into uee in eyeteme euch ae that 

of Raphael i?9) with t^moU^^M**^ MWIW m fWHJW*^ * on,u * 
static "me«w ng- bjfjt *lao a dsdMC.tive^Ji^j^^ *"• 

development of PLANNER and Uinogr,adVp ^^\gn^M^^m^^ .. *** "J »■»*•*■ 

waa deleted, and more m*-lUfi^l?*Mty&#!**SH^^ # r b n llWJ 
I anguage proc§sj|ing, lap M*M s ln MinogRad,* a system. £ut ne^ cj ? ajr I& ev J dent 
because of i ts awtion^ensMar ingnaVrf. dWk #*Jfc t^slPWM' * M*M 
sentence is not rm§) llM&? l n d J a Cf9 l ^^4nWlV^^ t ^^^.W^i* 
larger context. Eugene Q^nie*^^ of 

deal ing jiUh «>nta*t impief^t •entsnc^ 1 b^ 7 Sf^t»nc^ flT *^ ffH*> *°, th « 
way of word-by-word tr** If ting, s^tems %?r ||^ :%&&«** a^^thit the^ sentence 
is not an important .mfc unit, PMt it.#?ft MHtt9!| WW *M ***?•" that 
has a chance of success, at Wgh, | -!•*•*!* ■ | *S|l l, W«|f^l^ln>..il*..#*^ t can <toa | 
with the interrelationships within text. 

Uhilo we have not get come full circle, mechanical tranalation ia 
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surfacing again as the naas of ths prebloS, bri aors properly, as tha naae of 
ons problsa, tines language research hai dlWgiMi considsrablu since the 
inception of rasaarch on ITT. Avowed translation work Is underway at the 
University of Tokos at Austin, the University of California at Berkeley 13), 
at flontresl, and at Stanford tUMk», 37J. At least one co^Ony, Logos 
Development, Is MHtetlhg an HT sustee for English to Russian, among Other 
languages. The system ih*i ie« : dn poe%*1tors i&th S knoulsdge of the source 
language, but the company claims a hi^---h^ ^-'ikS^^^f^* : p^ttwiHXina' 
I could not get any detailed information about u 10111110 processing dona In the 
Logos III systes>, btft free sxsainirtg lbs company's tttstlriure, f get the 
i mpreasi on that sons sort' of «eoant ic t yps^ch o ofc lne; is used. • 



1.3 A.5iB^'^e*f :; ; 
Assuming that the t««t of a translating systsa is not Its ability to 
handle isolated sentences, b\rt rafhsr its ability to ileal with connected text, 
I selected a paragraph from a 'paper by toimpelmenn on octopuses tt7). The text 
was used as a .gdtT'inir thesysteVleno^lie^ Wlil^Wno^ev 1« be general • 
way as possible, the types of dtf fifties tJhbt ; *he par a graph presents. 
Choosing texts seemed much sort dsslrebla '-fi^^wl^ltel^erf,' since consciously 
manufactured sxamplss often hail an 'unn*tliiW sound, ATI s if he adit innocent- 
looking sample of "found" text wilt ususHv eon tafn a number of subtle 
difficulties. The ful f paragraph Is pretsnted snd discussed be rou, but let ae 
first display the accomplishment* of fli Irans^itthg eyeteisv 

Eln deutilch alchtbars* Zsichsn M' die la Nervensyttea vsrlsufsnden 
Erregungen let das Spiel dor Chroeatophocen dor Csphalopodsn. 
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A clear Ig v i » i Jfc» I • litfl cation p^m^\m\m^^ £#& **?<*& the 

nervous su^te*, is the, pjjy. o£ .^f^0^m^^^»KBif mlo f Md ' 
The skeptical u.i.ll sjiy t^V^-M* }%J^M>W¥ ^9^}^^^* ^^ '** "* 
reeark that a eajor factor ;J.iaJ|| ; nj| ^<^f^^ #%^te! i »} t ^J'« r « 
constrained acope of Jthe ^Ja* .fM^^iry. J>to&WmJM rtmi P"*'««* 
atill doreeain. It ^Mpujd tak^qnJjj A&'fliUlPto WWilWII^ *f pout,n * Mork 
to increase the ^p^t»ut t o,f : -M >!!**&, $**!*{&&*' ,Jr4 H ^^«!i ,tion "*"* 
again that thiji trans.»at|on wat J^|f!iJf%f ^t^\ A ^p«pt ^do 

understanding, so thaJTjIlj^J^^^ ^■« c " of 
coreference Ma# ; dpne,|sy J^us^^^^ not done for 

the eake of exhlblt^^ai^UktJ^ it Mas 

In the \nt9rn9t^^^mm^^MmiWS^S^^%^itv 

Let us jo^ at 4 j*f jpmh mkm*#m+^if l9 wi&fim>p f tha Pr**'*** 

that a Mchenlcaytctnfi^tl^ *"-*«* 

text, and fol lo^n^t,^*^^4^^ 
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(1 ) E I n deut J IcK $tipfa$mM$ff & ** ^IN'WWPlIf" 
v.rlauf.ndw Erreounoen It^d^lpJaJ^def^Pir^^pili^en 

dsr Cephalopoda/), 4jajiajr : .4«^fr e ^fff , JN|«l^eM«WP|pR4 9*Jfc* ' s 
braun, .chute?, *Jftl«# ^er a ^raj||^; f*|Jttfr*b&M* n « 

(5) dj e,, j|. ch,^t^e|pr 8 ^■Wwaju jjhyb q4r*fc&^<*» -«•- 

ans.tz.no. nuake^ ' l%*pf)|& -«WlWl ^ "JWl^ =■ - 
. konnen,^ I1,H S Jhrex ^J ff^ve^aj^^^v^ii^ara^^p 

zu tinea £tw n iajpa^ : ; 

anzupassen. Ola «^9^ ? @l|^if|^aVha|qpaa^p|^ J^^WMMWi^" 
(10) Raize ^erden n^cht nur dur^ls Augeo, ^gqgfj&.^ftl,, 

durch dieee FarbzeJ I en %ij$&^*^^ n jg$%$t?- 



1* 

Krafcbn b»J ptfitzlichbi* Zunohiio dor Uchtlntonsitftt 

* ' ' gtMT-dtaitelV *yen tMt^ v i^#'(pi^Milr sl'nlaV" ' tfflift^tftft'to 
' W^t-'ifci^lWIrtttScI^"**' B» I lung otter Avwdrtwiunfl dor ; 

(IS)' GhhoiartS^to?^^ Uonn 

d1ooo*nreitt flMrtfon, so stmr Hi tfVigoooJnon rfto 
Clw«ewifti^^ in Ruhoi u*m ei« abor sougon, so ' 
apt«i«i Jw». Sofblt dl»0bb*f ttohO>«>osth O?fonh«lt 
da« Unt fyu »d > 8 QPt, j» iHUidob , ob »U Qftt odor 

(2$f rauh iit, oirar vofsehf odono tf h**K*HF 'Wf did 

: (*»rw«i*ij|jiio*^ ~:f t , nrf#IW«w^-diito''NiiJtt ; ^« ! ' 

Sau^i»pf« »ntf«rnt uord«n *thd, btfctbt «t*»d»g 
golbgrou, f trbt si ch auf Roixuhtf obor noeh dunks I ; '' 
(StolnacW. ttrr1ao»5'dtf^t^i^^ "' 

(2S) ChroMto^^afei^ial pf l««w B»M9uno*n (Mr A<^« 
einborzugohon, uw sieh bsiortd o r t tcHort m Sdpls 
booboehton llsot. Auch dor Tr ichtor pf logt dabol Uossor 
au»*tJ»pHt2on. Ob dio C*ph«l6ptKMn solbfct but- Vorbdn 
reaaiBTBn, ist nicht b«*«>nt. NWJh voo «••• »ol1en bio 

(38) otcte ul#*dor f artjinbl fr*ilo ! 11o«sei*v1^^ ' ftp dbor ' 

»»nch« in der Tiefw* tobofiiin Tintbrif isfcho In 
- v«rschJ»d«o«n bunt«n F«rb«n •rftr*i1»nd» L«uchtoro«n« 
b«si t^«i von donbw tan anhlbsbi, doss Hi: zJiT 
gegonsoltigen Sichaufftnden dor Goochtochtor dionen, so 

(35) soheint das zub bihdooton ffir ' dtosot^olyKi ' W'oihort ; 
Firttonolnn zu spr*ohon» Elngohondo 
Untsriuebungon Obor oinon ottttlgon Forbonolnh dor 
Cepha4opod«n sind sohr orufineeht. NoWdliigs hat 
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Frfihlich Unterechiede in den vm Auge abgeleiteten 
(40) Aktionsstrdaen auf^y|rtf^tf^ A Far^J^Jf|^j^t|llt, 
was ebenfalle eehr dafOr tprlc^t, df«f dlftf J)«r« ,, 
Farben zu unt arsenal dan yeraogen. , 



A clearly visible indication of tha axe i tat ions that run through the 
narvous eystea is tha play of tha chroaatopheres of tha caphalopod, thoaa 
calls that lia undar tha skin and are coteroA^loug, brjp|* rt black,. 
purple, or ear a? no m. r - TW)H&l*KiiH tiWaWNtf aWlflaln, Via 
radial ly faeiened aueciee, be apread out under, thaek in eurfacj|. Ulth 
their help, tha ehlaerTe able t¥lo1i|^trao^^»SrW ^tha coibr l< of 
its background. Tha stiauli that trigger, the play of thachroajatophoraa 
ara parcJtviThei onir^^ 

theaeelvee. So it is that caphalooflda beeoee avjte dark in response to a 
sudden I ncr. see in Ifgl^fh^lWt^^ahWWii MT^'atlAe*. On 
tha othar hand, tha etate of. contraction ex- relaxation oft be 
ehroaatdphbriaMt alw ^p^y^t ontb. tuckar,; £lh«n trMe*are -not 

rwnvad ratlM a iwllwith or«u, ijthoy* jt ii111jM.U1,» *WS • * o 1 v«o 



in 



posseis light org.n. that shin, in vir tout brFflht'cef or., ^d since on. 

assuaas thasa aarva tha 

than for at taalPlhaai 

color. Thorough instigations ^ a *^iHff«?f% L»*«Pl •J u 8Ro% 

cephalopoda would ba very/ ^ItelfablS: ! ^aci«ty?TroliTch 7 u1ia dote td 

dietjrxjul eh differences in the neural Jpujee§ pant ; £ 9«| Jm t $ha^,H^n 

given different colored «tiauli, mm IgW fWWronlly iufelta that 

these aniaals are able to distinguish colore. t . 

The choices that had to be aade in producing'lhe transition wiVl be diecueeed 
in acre detail In the chapters tnai'foll6^ ( ; aui : a^G^aQ^'loo1i :> it the text 
reveals the fotlowlny sorts of probleast " 

(a) SYNTACTIC AIIBIGUITYs Choice of a part, can navi rather etrtklng 
influence on a translation. For exatple, line ^ani 8 '29 can be pareed in two 
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wayst 

Qb (dis Cspbslopoden) ssibst suf ' rarbsn roagioron... 
Mhsthor fths copholopode) svsn react to color ... 

Ob (die Csphsionoosfl salbst) suf Fsrbsn rssgisron. .. 
Whether Hhe cspaphaiopods theses tvee) react to 
color r ... . -~ ' . ' 

(b) PRONOUN «£FEBaK£S are quit* cmkh in the parsgrspn. In line 7, to 
Hhat deea jftrpx refer? The suaeJea? The, chr o o at os n o rs s? It Is our 
underetaeding of the preceding sentsncs, rathsr then sons hard and faat rule, 
that datorainfi a cbeJcs bar*. th» sh s o aao n s n of pronoun refenenco is not 
only linitsd to personal pronoun*, but in l«tiy« pronouns such as 

dftOttO in Una 33. ond dsnonatratiyt eo^octivejuoe pronouns such aa dioj* 
9nd jjtoa. In Una* if and 19. In addition* coaoound* Uk* gf&ftL and djjflr. —tt 
be uMd to rsfor backward (tins 27} or to for forward Ulna 41). 

Going fron Gorton tp English, ua can Often oat u ulth tronalating an 
ambiguous pronoun rsferrwe <*ltb «n squaljy esbiouou* En«U*h pronoun. When a 
sentsncs needs considerable rsarreiiging/hoievor, tola will not tiunye uork. 
For OMonpla, ua nay N*t to put in * noun or sup ubers |bs original had a 
pronoun. Nota fttao |hit tha Halted it* of thl a approach la partly « by- 
product of tha lonouaga pfirs photon. Going I -ooJlsb into Goraan, uhoro 
gender agreenent is rwsstsry bstwssn pronoun arid r of si-rent, a definite choico 
of pronoun referreof Mould be forced •opt |^|on,thao when tho direction of 
translation is free Gorean into English. So uhllf odacUion about pronoun 
reference ulil not be forcod In ever y. cots, .. a translating systaa mitt pf 
•quipped to 01*0 thf dpcifion uhsn nscsisary* 

Cc) As if pronouns arf "°* ««oupn» Uf si so hsvs ( nrlis of NOUN 
REFERENCES. Lino 7 usss »die Tioro* to rofor book to tho copholopoda of lino 
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3, and i t require* an ui^retand|pg of the paragraph to determine that the two 
noun groups are coreferentlal. .■ Ilr..^. : fPS-9 r '|W»«^f- ; ^? f ff^!!!J*^ , 6r H '. can 
translatable Tiere" limply ff "tha ap.^aKJ*^ ,||j!*N animal*" elnca thla 
sounds acre naturaJJn English), If, - : p^iba^ftb^„tie**d^ /pMe ; /rjer§" refere to 
aniaale in general, Jban.tha p^Ml^^^illiyV^ilM.J^;"^ 1 ^!^" ' 

(d) CHANGE 8 QF TQPJfQi I«kfng the, senlanc^iJeg t^oo. uj*b ; Talna E tada?*" on 
line 21, and reeding, the mjJn clejMp,o| tj^nfxl •enteoce. 0i l| I* opt specified 

whose chroaatophcref and arae wa are l^fc*JPff|^V"3^ h ?b^lf#W" i .^^^ pt * of 
any cepha I opod. Thera are , c J uae^pul h* hava. s 4p d ^|a|#>fn|, the context to 
find them, Prot»ap|yj tha,e|fifM,Ka^$o : d^l%Jj» Ja IH*!^ i eJ^WS , ?? n '" 
tha subordioata clause* j^ caiv.refgg^thj^ 

but npt of eledonee, sp thgdifeueajpn must per ta4n,|o^§a|^op^ l^jjffiaral. 
As ijj, (b) and Jc^aboya, a trenaJaUpf^an ueuaU^ge^by wUhput keeping 
track of chapjaa^of iopup.^But ajtuJiJome #MJ SfsJa^etM^^^^PWMo 
make eoaething explicit that ues left implicit in the original, end, for thle, 
knowing the topic could be crucial. 

(e) Another very difficult are* it MORO CHOICE. The paragraph here ie a 
■ 1 Kture of - tecbru ce I *ad,, cpafonj language. , ,i«p.|pnef «J,, - t f ^f*ra-. j pajgo n the 

.word, *he more variadjts, pas, ; ,for, : fusta*m4ea *ft|ff^«l*j**# SfT^fff* 1 fro " 
line 8 the dictionary (2# ea^|ia f a>*al ^ho#ee^be<^^« , 

1. color, tint, hue 

2. stain, pajnl^dua,^ r ,_., ; 

.3, completion .... - :: ,, 0i; ; . v .«.' ,-;-}-«;- - 

Therefore, even if we know enough to i«Jta«gje, s *aj i |, pjvg»oun and noun ' 
reference, we still hays i,|kt\ knp* .|pre», ^t^g^^l^^M^X^e^pt^ tmt^m 
different words, differences of co^ issues of 

what words combine beat !<] th ^lt 5 P$h^^r^. f . : r 
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Note that technical language It oftsn quite • bit ssbIst to work nith in 
terns of word choice, since the "'iisanlngV tend to be carefully circumscribed 
and usages ere ftotttM. fhia^alitff *eed one ^lnlnk*#4iihlaHly technical 
text would be a 'sie>fx*^^«»)iBl»MlHMtl ^irmna^atbr* " ; Cb e pari d to poetry, of 
course, ' i t would be. 'IsM'ttfeltaeaV'^^ 'le^hnleaf' # Itlng cannot 

avoid prepositions, Mil*e%|iwBewH1^ : ;eyhtibfcTc 

structures) are probsbty the twst ssblQttsws woTos In any tsngMSOs. Herd ' 
■choica, 'thar^er^ ie-a~#^ :'-"' :: - ■-"■ 

Itaat these #t*e pr^N^eae - tit ^tilvm&tMV - J bivf'' in coaaon 1 s that they 
are unified problems \n news -bntyt%88«h ^peftliilar'^l^iilfon presents Its own 
di ff tcul Hes and f»eo^frss i^a'-^ottMn -'U*^e|iB^'W^lpl4iil. *1ifsV In the end; is what 
eakee langusoe processing so difflcwit. The bsst lEhat'I 'tall I scale ressarch 
pro fact -ca^dr-le^t^ that : '' 

8^b11«r^te^w#o^NM^ea4Nie aee#^e1ii1 ; *1tt»4h^^ thai 

occur. ■■'' : "'" c ''""' 

1.4 Limitations 
Before beginning In ''earnest, '•i%^«^t^li^g»ol -laaa' to skstch out the 
boundaries Of the project. Sosir'^fcas^eettl^iSive 1 bean Incorporated into the 
project given wore tiae, *^iie seae arrlN^ their bun right. So 

here is a list of what the sys tee is nets 

(a) The project was Ha i tad to written text end net speech. Thue, problems 
like intonation, fixing word boundaries, and vocal -aWfarihCae between 
speakers did not have to be touched. 

<b) Aieo, Input was assuesd to be graaaaticsily wsl l-foraeds no atteapt was 
made to - extract vmmimtwg*' irm'+ fora that was not ful ly-dsf Ined for the 
system, flesders interests* in ths ..^©bise* of hantfling urigraaaat leal text are 



,.t -> r *intr>-~' 



«#*^?^ffi^'^-S^ 
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referred to Levine (21), who has developed a prograa for grading Gerean 
sentences in a classrooa situation. 

(c) No understanding of the text is etteapted. Thif if because the prpb lews 
of representing Knowledge and using this knowledge for disambiguation are 
extreeely complex. A considerable aaount of research is underway In this 
area, and whatever results appear in the field of natural language 

under standing' will have direct relevance for translation. The Gerean 
interpretive graeaar In the systsa is connected to the English generator via 
an understanding by-pass routine that requiree huaan intercession. This is 
intended for deeonstrat ion purposes only. 

(d) The systea is a very poor conversationalist. The Geraan graaaar, which 
is the most coaplete coaponent in the systea, is prepared to paree quest lone 
and eoaaands, but the rfst of the syf tea is gaared strictly for declarative 
text. This liaitation Is the result of tiee constraints, but also ref lecte 
the focus of the project on connected written text. 

The reader by now has an idea of what the sup tea does not do, and the 
following pages will hopefully aake it clear not only "hat the eyetea does, 



but how it goep about doing it. I havs triad tp.yiye Engflah equivalenta for 
Geraan exaaples, so that a knowledge of Geraan should not be neceeeary. 
Faeiliarity with Uinogr ad's systea (39) and LISP, however, would be ueeful. 



1.5 Organization of the Systea 
The iepleeentation was written in WFrLlSP, and runs on the POP-18 
Incoapatibla Tlae-Sharing Systsa at n.I.T's Prolect, HAC. The tranalator has 
six a^jgrcoaponents, whose functions ^a/^ i^tlin|^ below. It currently 
occupies about 188K of cors, which includes the LJSP intpfpreter. The systea 

, -,-'.■■..., :s f™ ?,.. .■•':.- :■ J ; "i '.' ■/>■■■ -«■■' -»■ ■ .* " *7* ! ^TW'5 "■ * ' 1 -■■■■■ -;- i ' ' a- - * ■ ■ "-' --■ '■■-'.•■ ■■.'■, •' 

has only been run interpret! vol y, but could be <%^\\96.Th»M}& component* 
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aret 



The (&IW BKffHNMft, which contain* syntactic Mil seeentlc inforaation for 
the appro* ieatety 2S9 geraan words currently defined in the' systsa. 

The Morphology routine coiled ItirW i# fhtv first peoo* In processing a text. 
Given a sentence, the ingot routine analyse* sach nerd into I to root and 
endings, then uses the diet ionerg to construct pesoibie syntactic feature 
lists for the coabi nation, this syntactic inforoatfon* afong uith seoantics 
picked up froa the dictionary definition, is then ■a se c tatod uith the sentence 
word. 

The GSWAMQWftl^i* written in PfVKfttftWI, s L19P eahedded language designed 
as a granear -writing tool (tttnogred 3&1. the greener routines use the result 
of INPUT to construct s singls perse tree of the sentsnes. To do this, the 
graoear interact* with the saaantlc c oap on e n t in en atteept to liait syntactic 
aaoiguitio* by the? Itttited aea ari tlc case checking currently lap tenanted. 
Uhere a syntactic! ambiguity cannot ■ be otioinated )iMMod)oteiy,e choice is node 
and backup is used if necessary. 

When the graaaar ha* parsed a section of the ssntsnos, it calls the SEMANTIC 
COMPONENT for initial secant ic procssslng. Seaant ice constructs as aany 
representations a* possible. If this neaber i s : wro» ' syntax "Is asked to 
r spar so the section. If the nuaber is rton-nuil, fell posslbls interpretations 
are carried forward. Theee seaantic aabigulties u) II be e 11 »1 haled later in 
the eentence either by the saaantlc co a pone nt or by ueef intervention, so that 
only a si i^l e rspressntai ion is sent to the generator. 
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Associated with the semantic structure ere th« routines of the ENGLISH 
DICTIONARY which, when executed, supply ■ set of relevant English words. 
Using these words along with other information aval labia f roe the semantic 
representation, the ENGLISH GENERATOR can then construct an English equivalent 
for the input sentence. Output is not necessarily sentence for sentence, 
ein,ce the translator does have a Halted facility for breaking down long 
German sentences into two or eors short English ones. 



The sections that follow deal with the system in approximately thie 
order, with the exception that information about the dictionaries is 
distributed throughout as it becomes relevant. Chapters 2, 4, and 6 discuss 
the sections of the model that have been implemented, and chapters 3 and 5 
discuss some issues in representing knowledge and understanding natural 
language. 
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Chapter 2 — The Gwun Interpretive GreaMar 



2.1 The UndfTluinf Thwr^^ ; 
Tha analysis of Geraen uas baaad on tha thaory of ayatafic graaaar 



davaloped by n.A.K. Ha 1 11 dag (18-16). Soaa of HaLHday'e ideaeon dlecouree 
hava alao baan used, but discussion of this it deferred to faction 4.9. In 
deecriblng eyeteaic grseeer, I auat necetearily be bHef* ,encf tha reader Mho 
uanta an in-depth traataant ia referred, to Hal 1 1 day, Hudeon (20), and Uinograd 
(39). 

Tha central pracapt of Halliday'a thaory ia that language ia etructurad 
to convey Meaning. That it, any analysis of language cannot, and should not, 
divorce fora froa thia tlnglt, overriding faction. Tlje job of conveying 
Meaning i a delegated aaong different •yntactic ypiU. of "^eh there trm three 
renka i cJLlufjft, flLflua, and uflcd. At the group level, there are noun groupe, 
prepoeition groupa, and adjective groupi (I follow Hotfscfl here in exiling tha 
verb group j aae eection 2.6.1). The aachantaa of ri^Mfft aareite one unit 
to aaauae the role of another, for exaaple, a cleuee%eTy"|taunthe place of a 
noun group. Associated with each unit is a network oT li§Ju£ii, with a aet of 
Mutually oKdusive features known as a amiii. Theee networks specify the 
choices evai labia in the language. we aovo froa one level to another in the 
network bg satisfying the mXcitaibfilte ; ! of r a eftetlft faclj chofci aide aay 
aet up certain conefriinte on the tWfeea etructoVeW'ah utterlnee by ieane 
of realisation rules associated with It. for exiapfiU s *he decl tlwn to fJtot an 
adverbial (a propositi oh group or' an ad jadttvagrooff in the f frit poa iff on of 
the eentence in Geraan Means that subject and finiti <t.«. conjugated) verb 
will hav* Inverted order; An rapbrtant point about Hliffday , « theory I e that 
the choices at a given stage are ndt ord*r«du1th reepwittb each other. 
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Unlike traneferaitlohai grgmtr, w« art not deriving one ful ly- specified 
structure froa another. Instead, at m proeaad threugh the networks we 
accumulate partial inforaatidn, until In the end the surface structure of an 
utterance la ful If epeef fifed. 

A network for fne (Jeraan clauses nandladi« the ayatea appaare in figure 
2.2, along with axenpl* wnt«nc«». I havt'lrfad ia'1dap fc lha^aa«niati'on 
close to thai of tUnograd, to that a reader " f iaif laVltf fh hliln% li ah' eharte 
should have no difficulty hare. Tha notation uesd in tha network Is shown In 
Figure 2.1. In Figure 2.1a, the vertical line indicataa a ayatea, and tha 



LABEL 

a) ^ mmmmm i imtiim ^ 



fi 



b) 




c) 




Figure 2.1 



horizontal Una on thj left spacii its an 3 anUy ^cgndl |lon. Tha tuaU* way be 
labeled, as it !• hers, bg writing the, label above tha ,fn try condition. ,If 
two independent eye test share the mm en trg cendit^og. this is indicated aa 
in Figure ,2.1b. If there It eort then ooe |njlr,y condition aeeec I a ted with a 
g/iven eyetea, it eey only b« neefffary, to G ^ 

Features indicted by a dathee (— I are unearned, and art oaf load aerely aa 
theabfwn^a„o|jthaothjr feature* in tha p^tap» !fl . ,.,.,.,. 
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The following example sentences correspond to the nuebered features on the 
previous page. The translations atteept to^isfe.** rough ,dM °* *•* 
structures involved.; 

(1) Ein Seei-kolon genogt. 
A see i -colon suffices. 

(2) Oft, genogt ein Seei-kolon. 
Often, suffices a seei-colon. 

(3) Sis warden eingentlich nur auffsJIen, wjqq aie 
amfalim. 

They would actual ly only be ne^lcejy whja tjtaaj f£ft 
Bltsiflg,. 

(4) U& den Hypit ZU ciilsn. storzte eJcb dsr Bauer Ins 
Uaeeer. 

Is. Sflyjl Hut dfifl, the fareer plunged Into the water. 

(5) AJLlt LiBitec. Aiogaad, kaaen die Kinder hlneJn. 
The chi Idren caee in ai nflffenojflffi ffijnit. 

(6) At p f I ft^K •nggtemtfim aehzte ein yejMesssnea Boot in 
Uellenschltge. 

Baumt Ifi a fifili* an abandoned boat groaned in ttw waves. 

(7) Ebon bekoaeen wir die Nechr icht. ejfajp dor Zug - i|eeft. 
nicht ab gafahron sol. . ;^> u L_,:„^ 
Ue have just recieved news Itot thy train bit IJCLL Isft 
yej.. ' : '.' : v 

(8) Eben bekoeaen wir die Naehricht, deif Zug — i pqch nicht 
abgifflrw. 

Ue have just recieved news Ihe. Irjjg hjg, noj, I a, ft ygl« 

(9) Ich hat to d*e Getegenheit, B^tiiQ XU btejtfbtQ. 
I had the opportunity & visit Bjdia. 

(IB) Elne Eiedone, pJajpa^aJajliJjSiui -SsttMisl^al'^aV^lBSaii. . 
s i nd . ... . ' ■ 

An etedone* ^\\ f^'idhaajl suckers 

(li) mt dew guxfih Lfchfrilir 

Chroaatophoreneptel ... L.™^v : | 

Along with the by li ght atieuiq>j«l icl^ad Bleu of the 
chroaatophores ... 

(12) Ein deut itch sichtbares Zeichen f|r dis ie NorMansuatea 
yer I ayf endejj Errogungon ... 
A clearly visible indication of the through the narvpuo 

avals*, cynainfl eKeitetione ... 

(13) Irh imKm an. riapj «)^ achpn fart 111111- 



I assume igjl Jhjy jet gjrjjgy flBDt* 

(14) Ich nehas an, ale Bind eahen fort. 
I assuae thau era already gpM* 

(15) Ich bi tte dich, ajch, Bjkojq zu baauchan . 
I request uou to yjaJt 



?.«^j»awii pw^ * i ^wH**<ff» r %- 
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2.2 A Definition for Syntax 

Di Bt ingui thing eyntax froa eeaentics is a si Ippef y busineee, •special ly 
when I have just ciaiaed to eubscribs to the idea that tha structure of an 
utterance is intiaatfly entwined with the aeening It convey*. Still, there 
seeas to be soae life left in this very old diet inct ion, so 1st ae Bake an 
attempt to deliait a useful boundary. Traditionally, eyntax nee dealt with 
the fora of language and seaentics with its content, or aeaning. This 
definition is a start, but it is too auch of a slapllf icatlon, since it 
ignores the effects that seaantic content can have on fora. In Geraan, for 
exaaple, the default adverbial ordering is tiae before Banner before place. 
Here, tiae. Banner and place look like eeaantlc catageriee. but their preeence 
hae a direct effect on the surface structure of the utterance. 

The whole situation Decease even acre coaplfx whan one conelders that 
aoet cholcee about the fora of an utterance have what are, in terae of the 
traditional definition, eeaantlc lap! i cat! one. It is true that there are 
cholcee that eeea to be aotlvated by. syntactic rules alone, auch as the fact 
that the preposition gbDJL takes the accusative case In Geraan, while one Ilka 
b«i takes the dative. (Hera, and in whayoHoue, J aa using "rule" }* w }** 
broad sense, to aean a regularity in language.) Nora. c|»Baw, however, lea 
e i tua t i on such as p I ura I fpraat i on, I n Goraeiv there are eeyera I waue to fora 
the plural for nouns, with ^^ different ^^ noumi takinj -arij, -e^, -"e, -"or, er, 
etc. The choice between endings is a syntactic or aerDholpflical one (although 
aany words ars fairly Idiosyncratic^ ** at ^^^ H !?%^ x }^ of * 
plural ending reflects a distinction bstuwn one ana; apre than one that ie 
bas i ca 1 1 y seaant I c, i n the tradi 1 1 one I sense. ,.,.,. n 

To flra up our definition, one way to go froa hare would be to identify 
the syntactic asoact of languaje with rujss that gavarr (..f^tanea foraatlon and 
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say that semantic* is the great eorasa beyond. This definition begs the 
question, however, since the content of language eay be just as rule-governed 
as its fore. For exaaple, there bight m'trufo te : "express 'tha fact that "Tha 
blue horse Is blue" is redundant, or the fact that ''The blue horse la green" 
is contradictory, to call such p henoaono syntactic because they are rule- 
governed is f inei the only trouble Is* however, that than It is not clear 
what, if anything* is left for semantics. 

In bui Iding the system, ay decision whether a given aspect of language 
uae syntactic or semantic mss based on the following definition for syntax. 
Syntactic rules givet 

(i) a minimal specification of word order 

(ii) a specification of the morphemic and lexical tags that explicitly 

aark the relationships between words and W'rafatJana within and between 

syntactic structures 

(iii) a specification of punctuation 
All the rest goes into semantics. '"II "am ignoring the semantic/pragmatic 
distinction, since I find it even aero difficult to draw than the 
syntactic/semantic one.) Note thai ttogHen"' syntactic rules Vol y very heavily 
on word order, while Gerean strtkss aor* of a balance between word order and 
Morphology (i.e. ease distinctions), llfth'aore' riliahce oh explicit tags for 
its syntactic distinctions, 'tkrmah has a concoaitantTy higher degree of word 
order flexibility than dees English. With reepecf lb Ii), the definition of 
ainiaal can really only bo given in the fere 'of a I fat of syntactic rulae. 
The closest thing I have to this now I* ths set of "morel order constraints 
built into tha interpretive and generative yesaars, but of course these ere 
not .Complete. To give you an idea of whet T hove In aind for (i), I would 
consider something like verb ordering in r a syntactic rule. On tha 



. -,,.1.4 • s-,-.-3 .' - - • -L)!5SJ»|»i3?f > 
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other band, such thing* as adverbiaj ordering <»«ntloned abovs) and adjective 
ordering before a noun ("the big red block" ae opposed to "the red big 
block") are order inge beyond the eJnjBet. ; and are ¥ |aee^ 5 (^ ejpantjc criteria. 
Relating thie definition pf aunta* tp the na|Hpr^i, e of susteeic grapeer. I 
Mould call a feature euntactic if its associated realization rulep all fall 
into the three eategor jam oi yep above. ^ 

I realize that the definition .of syn;te* gJven here Ja a sketcnu one, but 
it ehould be enough to givp a sense pf the criteria thojfc were ueed for 
dec! din9 ( uh ich regular J tiaa, ? ihpu Id be ra^factjatf. ,in ^ha^j^n tactic cpfponenta 
(both interpretive and jeneratjve) and^wbich ip l^jJf^Hpnt |^.^aaa»wipnt . 1 
ehould add here that theae prlteri* Mere ignored occaejonallu, for the pake of 
efficiency, ao that the interpret iva gj-aia^ ? |rc^aB ie far froa a pure 
linguietlc etptepent. ; 

2.3 Word CJaeeee 

Uhile I will nptdiacuet the treataanj of claujaa and grotJpa In dptai I, I 
would like to look at clessificptipn at the uordjlevjil. ? A deecr lotion of the 
different ports of speech ueed for Geraan words fpMpMe, while a •uaeery of 
the features ueed in dej initlona can be found in Aaaae^*!* A. 
2.3.1 Adjectives i and Adverba 

A distinction Between adject 1 ivea and ,.#*■££■, i^e pe^Jajly difficult to 
draw in Geraan. If^erp in Engllah^ «e ha^ "Fido |i^4a^%and "Fidp la 
good ." the equivalent in Geraan it "FidQ slngt au|" apeyFldp lej flyj.." To 
got an idea of where the distinction bp|ween a4Je^||vo| .and adverb* I a to be 
drawn - or whether aucb a distinction even ehould b^dr aj^i - wp need tp know 
the functions that these units Mi 1 1 have^ Flyp IJHplu^Jopking functionp would 
be'j 
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(1) attribute of a noun (declined), at In "cter bjjyj. Engei" - "the blue 
angel" 

(2) coup I Men t, as in "01 e E lactone tat tjiyg" - *the eledone la aaart" 

(3) postnoainal aodifier, as in "die Straeee llnki" - "the etreet on the 

■ !«tt 

(4) Modifier of a verb, as in "Er flnrt UtiUJLL* • "He drivea faet" 

(5) Modifier of the Modifier typee (lMhrough (6), ae In "eine iajjr_ 
guta Gelegenheit" • "a very good ©p^tuntly* 

Especially when deeling with Geraan, it be better to have a elngle 
adject IVe-adverb category callad.*Mo^Vfl|ra*;;an^'%ii^iB the probiea of uhere to 
drau the line. A highly conaervativa line, however, wee drawn for the eyateM. 
Basically, an adverb is a eeaber of -^.;.a1^''"^r'pWVarB)a''''fialct ; l.bn (B) 
above, but there ere aaverel conditions thai further conetrain adverb 
Membership. 

One criterion for an adverb that Mill be used is that it aay never take 
adjective endings, that is, perfora function (1). Further, m adverb May 
never have a coaparative or superlative f ore, fifcfh criteria smet be present 
at once, since toes perfectly good uords that take attributive endings do not 
have comparative or superlative fores, and there #e words i that can perfora 
both functions (1) and (5). An exaeple of Iw' latter i a dautl lch in "eln 
deutlich sichtbarss Zeichen" ("a cltarly vlslbls Indication") and "eln 
deutlicher Satr" f*e Meaningful sent**ee*T. Candidatas for adverb, then, are 
words like ao., Mbc., m, and a feu others. 

It Must be obvious at this point that adverbs era a rather select group, 
and that the c I aee "adjective" covers a lot of ground. To all aviate thla 
probiea, the eyatea uses features that correspond to the functions given 
above. Adjectivee are either ATTR (attributive, function 1), COMPL 
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(coapleaent, function 2), P0STN0I1 (postnoainal, funeticpJ),.or RElflOp v 
(relati>oai^iflars, fusions. 4 s^>) v^ajpajpiJ^ «n 

object at a verp dees, for,e*aa|>M|#^^ \.9t "Ij^Jtr Jf^fffL?" ( "^ hai '* not 
worth the effort"). .If ■ .thlf ,1,1 \ : p^iJ^^^^J^^fi^^^^.9lb^^ ^\ bo 
epeci f ied in the feeture J ijit of . tf*e>, f^jp|J „ •^Jj^s 1 1^».. . 

ATTRibutive adjectives are DECUned, which Meane that they .takp endings, 
which are specified for esse, gender and nuaber of the ajjin^noan,^ the, noun 
group. Adjectives are also sold to have a STRONG, ,_VEHU or HIXED declension, 
depending on the deterainer in the newn jfoup^. STRW|*- elective endjnge,. are 
ueed when an adjective is in f irst position In th^noun^ «;ouj» ?i for exaaple, 
"gutfljp. Help" ("good Mine") or when th| ajdjective paraUel^^anothar^ strongly 
declined adject.Jy|L ("f*'rPf« lebhafter JJeln* [/ *f Ina^JH^^^lijeM.. ifflf 
endings appear on adjectives fol lowlr^ajii deterainer ujtjt strong i a*dlngji 
("der gute. Wein" / "the good Mine"), end NIXED andjnge *o,U I ** ! nd t f l n , 1 ^ 
determiners, since these bag or say not carry case and gender inf creation! 
"einen gut in UeinVJaccusative), but "iin,gu|t >Mtn^ (noeinative). 

I should note thai ^celling word,* that fulfill fjp^ne UJ, (2), and (4) 
adjectives is in agreeaent with ths i.er^lyela.pf, GMiaJe^i , aMtjouph a^ adverb 
category is apre tightly constrained Jhan hi *• .,„_.....>,,.,. 

2.3.2 Binders 

Examples of the class BINDER are dill. DJSMMb f^.jJiQJl, «tc. Binders 

appear in the fl^t-«^n t I tc|^^ > ^*^-^ a % JiW*<!r^ p W!* t,ong 
(the equivalent of if/then) are not harKflfJ In «^ fe ^^ 

need a special tag in the dictionary. 

2.3.3 Cardinals ■-..■-. .fc*^---:- ■ - ^'; 
CardinaU.CNUn), ^f JUflii Jtfji. etc., oc^ ^^n^^g/ t g^»s. enp; right 

now the parser astuaes that i^ey wiJJ no*,^ de^lir^, v 
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2.3.# Cohjiinotlons 

T^'tiboHtf'rnM^tetunetfMM - yM, attttn. Qdm. aaodflcn a*d dann - are 
not given egMeette -features; but Instead -ia^ataffai* using a specie I 
function. In the suttee, the parsing of conjunctions it dono using a prograa 
taken froa Wihograd , s eyetew with only ■light aodff loition. Saa Ulnograd (39, 
p. 3fc ff.). 

2.3.5 tfefJr^inera 

Qeteriihers (OET) are a fairly diverse class, with the coaswn property 
that they can, and usually do, occupy W ffrtt petition in a noun group. 
Thoee with the feature DIP -the definite* jteL. die., djt, StU. ate - carry 
specification of gander, nuaber, and case. The indefinites UNOEr*) tin and 
katn are dlatrhguis>iad uith respect to nuaber (SIMS or PLUR) and take endings 
that indicate gender and case. : rtoses*lve'lisl*Wfn^ I Ike |aia, djjjn. 

and flein, spaci fy person, gender, and nuaber froa their associated pronouns, 
then take ending* for gender, nuaber, and case of the aain noun in tha noun 
group. Oetereinere that take •der" ending* are called deaonstrativs 
adjectives here (OEHAOJ) and include djia-, JjQ-, ate. There are also tha 
interrogative deterafnsrs (INTEfi) yajfib- and ygiaatl* Ualfitl- '• declined for 
gender, number and case, while iaMMm *••»• no endings and carries no such 
specification. 

2.3.6 Interjection* 

The a* words, like flbjc. «hd it, say Sppser between the subject and verb in 
regular or inverted order clauses. 

2.3.7 Nouns 

Nouns aay be either MASS, COUNT, or proper (PROPN), which are relevant to 
whether a determiner aay be used. They My be STTONG or UEAK depending on 
what endings are taken. For STRONG nouns, the plural and genitive endings are 
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given in the definition. Tbt paniti%t» ending I* nece *ear s g ,hfra.,#ecaMoe there 
ie not enough Morphological inforeatlop ii|i$»» Jff«t n co«ti^ r ^ ^iw •*• 
To do this reliably, INPUT would have to y tfKe,,|ntfl ^ eoifWrftion lip w«*er of 
ay liable* in the word and the nature of ite tepiin«|;|i|tfp,t. 
2.3-8 Partlcijiles ,. , ...,,, .-^^-ly 

The PARTicipla class consist* of : p»it, ! ||ei't|<?4pioi JRASTR) - Mhe 
aaachuo—n (fiuuft) - and pref^t perticlpiM M^^» . ' »« KtaiJUWOd 
(flUiMloa). Participles are not entered in the dictlof^pjy^awM^IMlli, *>«* th# 
underlying yerbf are. It is the respoBs|pUity af ^ eor poo logy prograa, 
discussed below, to tranffore a verp djfln||Jipn : Into feeder a^ariticJplf,. If 
a participle is OECynepl, then 1 1 , tekeej^, safe endings <§£ an adj.ctive. 
2.3.9 Prepositions ?s - -, 

Happily, this Is a sipplf part of spaae^.sygtactleally. , Prepoeltlone 
are either pre- fixed (PRE) as in "^HaijeB^\p£~p»rt-f Jk^^^ 
Haus gMfjjftgex," The esse* they govern a|e^a|tlp?c^tl*a (IfcU, eccujaHve 
(ACC), genitive (GEN) or pixed WIXED). i.p^either df^Yt^or a?CM?atlve» 
2.3.19 Pronouns i , K 

As in the case of determiners, there »• a nueber of varieties of 
pronoun* (PRON). Wp*t copfpn are the PpraowiJ pronoupa„ IPpS) . which ie 
actually a poor choice of terminology here,, ,elpce Jn^TjMWr they^ .&• 
frequently used to refer to Jnanlaate ebjecje, Pfrspnf J 8 pr,ppoupa are 
•pacified for person, case, nueber and gender (if tj^Fp^p^ejin), T°«ff !• 
also a group of pereonal pronoune disttnouiabed by^il^ f eatwr^RELBQO. REU100 
pronoun* I i ko dj*. UU. and M (when dt i # cpsf p i n d ej i^t^a^eppsi 1 1 on) May 
refer to whole relatione or atateMenta^rettoer, than Jue* tp other noun groups. 
The interrogative prpnouna (INTER) Jikf, yejvend UU ere aeaoif ied.for caae. 
uh lie the possess ivp pronouns (POSS) ^J^9l^^ J !^ l l^l0 l ^\^JfW^' 
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nuaiwws ind|i'i«i'ifJN|rineiin part, then end Inge (nd teats gender, case, 
■ and wn&vr In ^ix*omm**r0\§ ■ i#i' t^"--' , i«*B« : ; gr«ii^'^ f liyirt^^<f i ^* , tinioMrMr 
MB8TWBTI are tttaai f i fee ejfl end ieaairi. *»* relit i ve prenduhe WEL ) - djx, 
Hit* daa. iiej^* ; ;#j^ajjirii|i^ 

and carry gender, nuafcer, and case infereatioh, FinaHft/ QEFlni te and 
IMDEFinlts dste™ir*^r eag also be used as prs r wuns , ea they carry the 
features PWN Q8N (for aaaoneiratlve pranauhJ. 

2.3.11 Quantifiers ■ ' - 

Thi e*-'Ciaaa' "la died tor - aards'-^a^alajlajar - in f iret poaHten^of the ridun 
group but can eoeKtat Mfth deteralhere, e.y. "Hi ara fteneehert* t*i&. the 
people** and "nlftgt •'« fiaahalaaod" feua a aeohifopod*) . 

2.3.12 Separable Prof i*ee 

■■'fn genera i .rinsee** coii^rde Mlth tlw ciaaa of prepoattlofli/and their 
usage eorraaponda to the English particle, aa In Tlf^eall thpe up..* 
Separable prefixes OCPPft) appear In the dictionary aa asperate uords in terae 
of syntactic feature*, but they are not glveh ^alaihtiti daftnttiola Independent 
of their associated verb. 

2.3.13 Verbs 

Verbs cose in ' ine fetJeuiny varieties! First, they af either aaln verbs 
(fIVB), auKlliarles (AUX), or eodsls ttfXWtt.). The auKmariet are bflfiaji, Hill, 
and yj£dM and nodal* ere thbss thst rsqulr* W ea1n*vefb Inflnt 1 1 ve. Models 
include itftajt^, : JuditJb soaean. etc. 

If m-v^^' W tt-ff^-^'i^'^PUimi-^W^ or^iffirrttr- 4SS s PR' : 'a*ana that 
the verb fore happens to have its s ep a r a b le tti »H x attached (ajbMQflia) t idSi le 
-SirWaedh* that the sepsr stale prefix- Is t oimhsr < else r iQht no«4 ■ (hJnfll 
davon y aj^y ; : #tMN vergers 'ill^h*^ specifies 

uhethw ths verb is reou+er, in-sgulsr, etkad, er takst an oaleut in tha 
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•econd and thjrd parson lingular ...JPffi. lfR^tWin, •nd.UtJU r»^P«?tJve{y). 
Verbs with lrtgppr*»J9 ir*f«»i» ^ke£ W J#ej^,ar* £ e^ This 

inforaation is for the bjoefit .. jq^.^tl^iV^^l^cp^GfllM^V*- * 

Another sort of JnfcyiMitiojy.* tha,,typa of object that ^.virt *»U *•*•. 

which uilJ determine the tr#naJy| 1*1*1* <M. ,#W tyjmili^s-NP^J^i 1 # **•*"« 
features like -Relive, ^.tm^m^^^ #P^W%t)rV<^^^«*»*^ t,c 
case for noun groups, and .|ythVv«j|^M^#^a^^ |^^l^«. ? Fpr sxaeple, A40 
is the feature of a verb, that .takjo : an ^fpcj^tj^and; MfSfe 1 *^ ,not 
neceeear I ly in thie order), I f a fceaoei Hon J^rea^jf ej^^yerb,.. than 
itajranpitiyil^ la ftl,vjin »s "P", Hhlle If Jt)£ .fJbgf to|J^I ijra^iirirtj "f" *• 
used (for rw particulaf raaaonj "A" ; ,jf«; rl^^JM'Tft^vW^^ V 0^ 
intransitive varb is still Barked 'ir.^'^J^'^^^P^'^^J^ 
dative pronouns used reflexively. "Z" is usad \f aT#n|^*tad 5 >ur r c4a^aa is 
required, as in: 

Auch dar Trlchter nilegt i**^' HlllltT IWifllinrltfTO: -,.- 
At tha saaa tiee, tha aapulatory funnel usual > t% ^HsMlJTA^ JPM$ M a lf r ' 
I hava not usad categories I ikf "require^ locatjo^jjnee Jvia popa^to be 
part of tha broader phanpaanen of se^^^ tna 

verb, which is handled by the jseajfltic c^s^ne/it,. Un J^ls caae^ a selection 
reetrlction Mould be used.) A verb with r a v re^ired ;Jw^on tJ .then^ is marked 
uith E. P, A*P, etc. as relevant. For ^ : c©aj||etej la^^ tr§neJtivity 
types used in the ey»*ee, see Appendix A... 

A natHprk liMFIgxra 2*2 glutea ^^^a^dpfer^ti^n a/ .a^iictyns^ ppaall^f In a 
language, but doee not specify how i«ifa^^ to 

this descriptiys iiif^rjiatlon* One,spiu| ! |fln %>Vj^pri&^,pm*Q\mn by 
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tHnograd, and tot the interpretive greaser, I followed his approach o^rf t« 
cloesly. HI riogritf's approach f* not ths only Way; e^'eotiresf, to uM the 
inf creation in the ' networKe to Quids psrtlng, end I ui 1 1 cot* back to thle 
issue in section 2.7.2. Tha Csraan ; imaf#a¥f*i graeaar nil written using 
PflOGRWtWR, a U» ^aau a dd a y language docu s sn ted In lMinogrsd,3«). In brief, 
PfiOaWtlAR provide* feci U Ns* for constructing, lhepoctlng, and aanipuiating 
a parse tree. Tha baelc function ia tPHilL *ifaitiraa>K which inapacta tha 
input santenca and tries to add the 'specWieo* : '***je' ! $b the tree. A group of 
feature-axan in !ng function* (NQ, HQ. CO, ISO, all"! allow inspection Of tha 
eantenca or trea for particular features, aiwi ai*s%*lf group (aaong thea, F, 
FQ. R, RQ, At»-F4iQut, and HEnOVE-F-*»E) a Mow changes to be aade In tha 
features of a node. For Moving around fjhs' parse tree there ie the function 
WOVfe-PT, while flOVE-fTU can of Weed t© sovs around 'ttaaf Vnput sentance. If a 
pares turns out to be incorrect, the backup functions POP ahdrFOPtO aay be 
used to reaove pef*t*eufer neaW^f^ ie 

the branch stateaanti 

It Ufunctlon Or variable^ <leb«l-l> <lab*l-2> <lebsl-3>) 
Control goes to label-l if the function or variable evaluates to non-nil, to 
label -2 if the valus is nil, and to labs I -3 if the end of the eentence hae 
been reached. Label -3 is optional. 

Sinca tfKttfcfflftn' ua* designed to handlii I thgl f eV eoac changes and 
ex tensions were necessary for processing Geraen. these involved the addition 
of another syntactic Isvsl, that of the phrass, tha expansion of tha apparatus 
that assigns feature lists to words and nodes, end a eechenisa for handling 
partial iflforaatlon as ft is aecuauia%d^ -fha *riirt! of* this aacftfbn wi ! I 
discuss these additions to WKI«Wt*Jt. 

*e 1 n W riogratf* a sys tee, the actual parsing fa don* by the ayhtact 1 c 
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specialists called by tha PARSE function, S^tact)£ epec la lists corresppnd to 
units, so ws have CLAUSE, NG, ADJG, and PREPS. Irj adjiyiont, the Gerea* pareer 
contains routines that handle phrases^ tha# J a* co^ituante ^ that of ten 
appear together and for which t^oMtiniigtion sake* ., aajpea,, ^|t which do not 
enjoy the theoretical status ota group, o^|AMSS. t ^Thjjwajc 41* f*r*nce 
between the tref taant of groupe and phreeet 4M*W' ; (|/>oJfr ,L». not eetabJ iehed 
for a phrase .when its fouUne .if entered. Jgu.t.a jr«#» §l#|Uf has i ts ,4**» fode 
narked with its nana and features. Phrase prograas are used to handle things 
like verbs and objects of verbs. The ceneonents of these phrases are 
interrelated, so Me Mould like to hand le a |hea ^ogathaj^ T^ ■*• " ot fMU 
un i ts of the gr aaaar , however , a ince the coaponants ,do, not have to reaa i n 
contiguoua under a 1 1 circumstances. For exes* la, \n £sc*an f^a^raf t object 
nag appejr before the finite verb, MhlJe gi| jW»*f*t «MJ«** of the eene 
sentence cones after the verb, No changes had, to b^,|fdf ) in PROGRAfip^ in 
order to Mri te phrase routines, , since Jheu, ^ ace trea$e4 1 ,1 |kf ordinary 
aubroutinea. Phrase prograas lookjjke g^pufi^Drog^a^s.e^cfpt that they do not 
use the reserved tags RETURN and, FAIL. Modifying PRflGRAHOAR to perajt uaa of 
these Mould not have been particulery difficult, f^^for uniforalty the change 

should probably have been aada. 

The case-gender-nuaber conbinatorics of Geraan (described below in the 
Morphology section) nade it necessary to ewiteh froa 8 single list of possible 
features associated uith each input word in the original PR0GRAN1AR to 
Multiple feature lists. Each possible usage of a moNK : then, is expreeeed ae 
a different feature list, to handle thess, a feu staple Change* were nade in 
PROGRAtiilAR. First, most functions now handls a lief of feature* Mltn an 
iapliad and. linking ths entries. Thus (NO flASC SlNGI checks to see If eone 
feature list associated with the next Mord has both ''ttfac and SING propertiee. 
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It would fail, for exeaple, \i (NOUN rtASC rttffl and flttJN FEN SttO mti the 
feature eete in question. 

Two other additions to PROGRAmW, FIX and MDNEX, uer* also Motivated by 
the pro! I feration et case, nuaber, a^ ]|Miitr dottaiii I riles. They are used 
for dealing with partial inf or nation and are d I seuwsrf further In section 
2.6.2. To dleeuae the Interpretive process, let ue eta** with the morphology 
prograa* since this is the first stop aaao by en inpot sent e n ce . 

2.5 Morphology 
2.S.1 Analyjing horpholdgical Tags 

Given a word, tns job of ths sorpholoou cosponent Is to determine it* 
root and' then attlr^a- n*rdrsynts*ftr^ 

associated with the Hoot and endings, horphotogg finds tt§ fhforl*tiott in the 
German dictionary, which contains both roots and endings. Syntactic 
information for a root is listed under the Keyword flfttUntS, where there ie 
one feature eat for each pose! bis usage. '' " Ihisr'tW DCtll tyidjj, which might 
appear 1 in contexts lit* »dle brsltsn ■$&mml l& 'f*t^H#dad atr%ata*J ! ' and'' "Ola 

Chrbaatephoren ore Hen ieJch ana," t*Thm/&re»i$epB#a^ 

.,,.. .. . , ,. vj '.. .... i 

syntactic part of the dictionary entry I oefci MImm 

fDEFS 'Wstlt" : 're^t8iCr-'t- 

(V0%1 HE£ -SEPPR AUS) 
(AOJ ATf* COflP 9«*» ) 

In other words, praJLi asy be either a regular verb that has the sep ar able 
prefix gya,, or an attributive (that is,- prenoalna|) adjective that can fern a 
comparative or a superlative. Actually, such dictionary entries contain aore 
i nf or eat ion, and the coapl ete spec i f i cat i on nay be found in Append i x A. 
Morphology finds its ending inforeation as a •!•* of feature eete 
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indexed by ENDING. and then t^p^lJi^jtm^^^^^^nim^^^^m'^* »° 

there it no chance of sonfusJflg the sndlrxi^ «U£ $*, uord ft* ^ThU, 

distinction it eore for the benefit of the ueer, alnca.^j^lfictndje* ie 

enough to keep the systea on the right trectu) Part ef the inforaatlon 

associated Hith >ES looks likes p ??? .^" 

. . (DEFQ ?£S ENDING ( . , ,,.. ■ ^-.-v 

iPRON 

(PRO** ABSTRACT* GEN SLNQ))^ 
(NOUN 

(ADJ« Atfit tta STRONG NEUT NOH SING) 

(AOJ* AlJRe .J0E^STimtl4Pffi}Anfm>)>,^ : - ■- 

There are tub aaln routines in sorpho logy, INPUT end TRY. 



2.5.2 The Routine INPUT 

INPUT is the Geraan equivalent of Uihograd*s iorphoVogy analyzer. It 
starts at the end of a word, aaking successive cute until all ending 
possibilities have been tried. Ulth Gerlln^ue 'get involved ?n coepound 
endings, for exaaple uhen the present j^tlcV^^'*^'^^''^^ 1 ^ 1 '* •■ 
In verarileeeende (causing) - *fltaallli + tift *~* Inliddi tTon. there are' eo*e 
prof ikes to consider, as is ths cats Mitt ga. in the paat participle, and there 
are also soae infixes, i.e. go. and in, at in 

ausgearbeitet tubrked out) - aus + ge + arbett ♦ e^ ^ 

anzuschauen (to look at) - an + zu + schausn 
The Vnput prograa handles at rrsgularly occurr^i^a^^ and 

a led takes care of soae non-standard f aliwtloiw^Tiila^ tfw'aldTlfroii of an uelaut 
to the verb laaaen in third person singular (iin±). Caeee that are not 
handled by INPUT* s ending analyzer appear directly In the dictionary, Theee 
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include the various incarnation* of sain (to h*y. oa*t parUciple* wi th a 
vowel change Mfcs* attlTTmtmn ■ *EriW :? •&* *H*a" '#fth unusual plurals It** 

Sad* (pi. laajDs). 

2.5.3 The Routine TRY 

Ones INPUT think* it ha* a likely tpltt, it *ei^":th|:at]*« an* anding 
Mat off to TRY. Tha firat •t^ i M^ i k^ i dkil^^i§^\f^ffm propoeed root 
does indeed appear in tha dictionary^ -'*ff»eo,' ^f^fofl^ It* syntactic 
feature* and hold on to the*. >to aia? piek Mp fft|<f**t ifooeiatad with tha 
different ending*. It i» a* ; 'tnia? ! 'pai#* : *»ii$^*#^ir*li%otice eea* epochal 
problems associated with Gerean Morphological proceesina.. Since an adjective 
ending nay neve four case** three g*ndera». two nuebar*, and eay be etrong or 
weak, the ceabinatoric* begin to be a prop tee. Aga^Jbia accounting doea not 
even take into consideration verb*, noun*, or participle* apaa of who** 
ending* pay coincide with adjective ending*. (Per. this reason, TRY Rake* a 
firat paa* to deteraine the part* el apeech po**|ble for ^the re© t and then 
look* at the ending* to see of they fera a perai»*lbi* pattern, given the part 
of speech. For exaapie, hreif o j <b£OM) need net be tried a* a verb, einee no 
Gerean verb i* foreed by adding m to a root* Mara, only tha ppeeibllltiee 
for an adjective need be considered, and sine* the endings M*t* are all 
indexed by part* of apeech,.. it i* * slapl* patter to pick out the relevant 
possibi I i ties. 

Haying narrowed the field toaawhat, the n*xt step it to call the routine 
MERGE, MERGE eove* f through the ending* list*, *tartlng with the lift* for the 
last ending and working back to the list for th* root. Its Job is to cpapound 
information and eMalnate bad combinations. To do this, MERGE need* to know 
which part of the ending poaeibiiity Met )• required information and which ia 
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new. In the systee, stare denote Inferaeflen the* «Ja*<** preden* In the 
preceding featur* list and unaerlwd fsatursi ««• eteely added. AaawaKOaple 
of the notching and compounding dene InWRGfcV ff*d MttHtftt^ltfioa^ »**eh 
i« a participle: 

liegt (VERB IRRNYB) 

-end* (VETO* FRESP) 

-est (PART PRESP* STRUNG NEU? NB1 SING) 

(PART PRESPo STRONG NEUT ACC SING) 

Hatching on ateWed e*as*ia, andaddlno the unatarre* Inforefltfon, «a geti 

twwT presp ma&mvm mwtmMm) 

(PART PRESP STRONG NEUT ACC SING IRR NVB) 

Thooo are slapllflsd verelbne of the two feature eatr'that/am do aeeoclated 
uf th the word Hiaondes . A special action isartafceft h#% by tha routine TRY 
because ue are dealing with a cfcangs In p*rof speech. -For thia epecla* 
case, tho * eature VERB was removed aftir PART m* added, to proven* tho port 
of Speech designation froo betteaing sebiguous lh^t*#Mnirt feature set*. Thlo 
deletion To done by avslepte choc* In TRY, e4fwo aba** Of fcpbech Chang* 
occur* in only » feu cases. 

Each cal* of TRY by INPUT aay add feature dote to tha syntactic fearture 
Met, so that In fact a word aay be divlded^%ever»4 «eye. To fceep* tho 
possibilities etralQht, a root Mat la also constructed, with an entry that 
correspond* to each feature set. s v , r 

2.5.4 Spec fftl Feature* of INPUT 

If INPUT Is not suooeeeftff with Its initial ontring anal yaia. It Nioke 
for a coapound word. CoaiHHind **^ a*e^ HvGeraw often Mhare 

Eng 1 1 eh uou I d uee a c I ee*W I or plus a noun. German usee uj coapound, Th* 



* » , 



42 

•ethod used is » bruts farce one..,, Tjte,wei?dr|a'iipHt io*|s s tp>.#arta and ; oach 
of these is fed t° • reeureive ©all of JMFMli»>< I* both sections turn out to bo 
words, we construct a featur» v *:**t* If not, than t*o novo ovor a tottor, stake 
another split, and try f gain. All sorts of refl nsa s n ts, e.g. on fcy oak log 
splits between syllables, are poetlbl I itlsi here, but nothing I Iks this has 
boon done. Right now, the eystsa handles only opsf^indf asde f roe, two 
components, but the code cxujd bs g|p^aM«*d fairly s*«Hy. Tho coapound 
analyzer will accept now + noun pairs I Iks phrnMtnnh^ fft*?* a I ("play of tha 
ohro«atophor»s"U verb -- "-- r mt "' ' ]Yr 1 tlUfrtnTMB . "" '.ftlt TB*t" * "•"* 
pronoun + infinitive used as a noun pairs i )K% iSifliUMifilXlBB ("finding each 
other"). •■■■-.".■' 

Anpthor Jfoa|ure,p| INPUT, J • itp Jifjdjjno^of %f Initive M erbs uaad.ae 
nouna ft I f a word 4# syaJ uatsd as an Inj js^gj vs v^^ a.faf lg rout Ano . | a 
ca I |eat .to : add. ftaturse and •esantice for ao Jnfinitivs used as rnpun. .Note 
that in f nor Ml t«*t, both y*i» and noun would not be possible at the sane 
t i »*, si nco, noun* upuj* be cap J t*l JJP% ? , $fe 3m*Wk Mtip IW» f ST .JffPHf • 
however, had only upper cass, and It ■■ssa d that any, specjol convent i one for 
nouns would bs a burden on the ussr. For thKi^ejpw^fiouhjp; are not 
distinguished froa tha root of tho Gersen input, and ths systea Juat worke a 
I ittle harder to pick thea out. 

2.5.5 An Alternative 

The aorphology component works well, with only one reel hitcht words 
ending in m take a considerabls length Qt tif»% *«an i*Mh #« apocial pjiaaes 
•ode to cut coabinatorics, This aay bo an indication that aorphology and 
syntax shou+d not be « two~pa»* proposition, but rather that Jha otate of tha 
parae should bs used to I isit possible ■erohological ana lye is in the next 
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word. Thus if we have "einee aljejv(1«jin" ("«i ftW JMP% M$l** MPdvtftl the 
tlee the alnas has been parsed we have gathered enough \£ifff^t^,0w& :i ilSm 
to Melt it to two possibilities! either (ADJ UEaK DAT SING flaSC) or (ADJ 

UEAK DAT SING NEUT). Therefore, to soJ,^aor^ypM*# J *fl , ^RfW c j 9 ^"**' 
ms eight distinguish two IsvsJji of eoriiholp^j^ifa^a^^ ^Jha.hig^ level 
features could be assigned ins prm*W$fcfiffak M JlrJ*f AB8W*»IW'-- At 
that point we eight Just Jtpcjf* •#»» rW^^'lW-W^l'^ a 
per* i ssi b I e adject I ye ; enoMng (Ajy O^l, J^j$/f.,Jik IfT^!^ P^ a****' • 
second wptoJog t ica| oaas could jcha^ ^ paf ,M LJ^^PIfe^f!#e|:tlvo 
••*«b»tfd thp correct : caae* ;J endejv f m*^, ^ ^ll% r ^V *M»m, 
detereiners and pthjr adjectlvsi l Jjp.. ttiaj, founu^^%. m |l^^»yjh J99W^« * th * 
..cjfbj^torics^ of s ,adJec.tJve^.e^i|jgs.^|ed on^be te^le^whe* l^.**^ 
absolutely necessary. M Ttiia ^ aDoroachjo^J^^fct toe-,jttif^|JuJ,t ^ ljplfaje*, *** 
it would involve considerable P**<C* tlfP^l ^ffW^I^^'^^ it. 

2.G Th^ Qoeratipn^f Jha c ||^ r , l . £v , :; 
The interpretive grseear operattt m tJf^^^f i *fT#? IfM^*^ 
lists output by the ^eorpholoo^ routines. J&gip Mm* ,% c ^ ruc ^ par ** 
tree, and at any given tiae the JMMfT ^HU* 'Pl-f^lS 1 * fl k "^. "* PfT 8 *' 
If syntactic aabiguitias Isad the graeaar astray, special backup routines are 
cal led to find the difficulty and est ths graeaar onto another, hopefully "ore 
successful, path. It does not sees prpfi } taj>JfLjp djspu« .the, jnjarpfe|i¥e 
gramaar in detail, sines its ^^^J^ff.^j^J^y^^^pf^f^f*** 
Eng I i eh graaear. *•*»»«• parss Bay be ^^Jn^jkf/fffnd)*^, fnd y reaajprf 
wishing further inforeation on the approach to jparejng. shouJd consult Ulnograd 
(39, chap. 5) and, for details, Rubin (32), wtat do seoe to be worth 
discussing are the places where Geraan praaanta apeci a l ? pr.pbJa|M^ or,, wharf a 
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different Ifcprtfcch ; iiea men, to *hls socfidh iih I lii devoted to an assort sent 

o* apee Vat iftjjrttjt'. 

2.6.1 The Hvfse of 'the te^'iSrbtfp ' 

*s ^'s*tttfo1eft$hNlafrYy; W#ir#tf^graeBlir *hWepec1al prograes for 
classes, ' noun groups, pl4fpO$ t fo% ###11 and adJsctTvs groups, There is no 
vert) grb^'iMiM*/ W^e ft % titife tffeWWfrW 'ifneteaft.' The demotion 
of the verb gntup *bi feto* Hudson '<&», ike argues 'tHaft one criterion for a 
group la N^it VH components lust oa conHeuoO*. "* ¥h l%t1 sti, of eouree, varba 
do stick 1®ge%her a gbttf par* of the tie*. 'Hid Wis 'easier to aske a caaa for 
tHa ! ai<T*tai*a of • v«rb ijroup. ''in ; %ereanf ^aW; verbs arl separated quite 
frequently, fdr %*e*pte, ffi WjdaT IfiMefJNeWtiBlsit* ^^ 
i*W'im--&k uo aarirar,* Wt#ifTg, -Ran^t aafliar tit oa^). In fact. 
Whenever 1?ffere'ere %eris othar tf><h thi f fhil. "V#-b tn a aafor clause, German 
word order requires that these other Varai go to the and. Giving verbs a 
phrase instead of a g^oup status boat 'flat prevent the graaaer froa developing 
the special felitldhtfiipi \hat occur ostSson ^«ree l*n a sentence. It ia the 
clause program, liPMavar, Instead «a% i verb Irdup prbr a*, that ia responsible 
for developing these relatione or calling Its in Mantle routines to 

16 so. 

2. 6. t ma I Ihg MHl altnforilt Ion 

The oKlglhalfttiGtoftW ceaet sou i peed wTth three aeehanleae for 
recording inforaation^ac^awtal^ % His Isur^oTe parssri^irst. there 1 is 
the construction of the parse tree itself. Second, there are routines to add 
features to e parse-nbos, and third* ue can eeV and access variable-like 
registers ttabcleted with a node. Actually, there )a a fourth aachanlaa. 
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since the control structure itself is 8 way of holding onto distinctions 
ui thin a routine. None of these aechanisas. houever f$ a^ t tejN|ticf lly. 
distinguish bet usen inferaajipn ttja^ : i%in s safa ..saj^fraj|§stary end 
inforeation on uhich further decisions»cen be baaed. By fjj aaa s n tary or 
partial inforaation, I eean Inforaatlon about elialcaii^na^ haMebaan jiarro«ad f 
but not fully dec i dad. , for sKaapJo. ihe : de^ni^a fj^'B4i !4 ((B^Jf t a»f|oi«R, J froup 
iePaTive, SINGulsr, and either nAXulleo^ei* W8*£ m Houseful to Know 
that FEmnina haa-baen ali*ina4^fro* : ^ but 

this is only .partial inloraaiioB, since ftfe&ie point *he* parser still sees 
dun asfciouous, A parsing euetee should hava **jyMa a>a4 »|% thla-partial 
i nf oraat i on aasl ly»~ bojh to dselonate thp indorsation; Jjyp^JS* $o **♦ I ; the 
par tor H*>are the PP^al infers*! ion j*« : . . 

Largely in response to ths oe«ier^waoerr«ase coa*Jn*terl«s 4 two new 
feci litis, were added ta PROWnttJWh FJK a nd.WC X. F4X give* us a May of 
handi i up bar tl el 3 i nf aramti on be fere a par e ona aje is constructed* I n the* 
Englieh version of PROGtWtlAR, the only wag to specify the feature* required 
in a word is to ue* the** as -paraaa#er*a* -tewaeltaji -Aeiie*4en,aej|&}of PARSE. 
HHb FIX, ho can e lie! note possible par •a*,©! the ns*t< word a* ft* iff* la*ant 
inforaatlon i s encountered. For exaepJy, jpIX IttSR WK) mtm iny feaHure 
sets that art not bath HASP and SUU* «>deteaaj* ; .efjaW:MaeV ;Si*yar4yj «a 
canteay (FIX OR 1\m NfUTI . uhU#» ok Hti, feature sets that ere not sUher 
t1AS£ or NEUT, Fseture est. that hav» been *N«in***a%*r* put; behind a barker 
so that they ars no longer aocaealbiA, to? Jlsfcfn^p^ilisip^c^ alt^o*^ it is 
oapy to recover ojd,pe«8jlMll*MM»iby;fer*eM^hp»^^ 
eeveral ti see, then, to narrow .,$bt , poaejbH^eatbaJaisfcPARe6 : ; i* finadjei 
cat Jed. One fee iMty that, FIX could hsv« mi aoa* mhr leJMtjaoiMe FIX NOT, a 
way to d4 squall fy feature seta that contain tfce feature gfvon. 



46 

Orra plans uhere FfK Is used frequsntly I* fn dealing ui in verb phrasss. 
Consider the oxeaptej 

fine 1 tedeaa, dereh ofcrt I iche Saughepfe eMfarnt i*^deh elhd, ... 

An eledens, »^»e sucksr« hove *H been rseoved,... 
After parsing tha peat participle yerjlHi, us know that the ne*t uord Mtbi • 
form of win md ttthar an ?NF1nJfive or • fin) t« ver©~. Mb can uce FIX to 
record these facts, and then cheek' to see If tha verb Is fihlts. (Ail bcit 
first and third i^sei* plural of boot 'vera* can bo uhaiblguously dlitinyuishsd 
froa the infiwWve fere.) Sine* It tfrntVeu* the* |M fa a finite fore, it 
should agree with th* eebjact, ** «w can eel 1 ■# f« again with ths psrson and 
nuaber of ^JflDteit' *F!X P3NB PUJft)* Net* *he* tfWadMltfen of FIX doa« not 
give PWJSIWtttR any neu power, si net ws always could have dons ths alia aorta 
of tMnge ay i^fr^var iairi^ FIX 

merely aakaa it easier to sctueutste I nfw act ion about the newt Mord to bo 
pareed* ' aven If tMt Information t« found in »4d*4g eeettsrsd ftrtir of the 
grammar*, "'■'■■■»■;.■.. 

NONEX, for noh-ewelusivs parse, allows the PARK: routines to live with 
ambiguity, at least to a UmWedewtent. When we siafta a cei I like (PARSE ADJ 
OAT SING N0NEX), ms are saying, "flHrtnet* f tatore aat» that dan* t agree with 
the features epecified, but if mere tMkn>o^^faa^##ait'1il left <say, sots 
with different gender*), don't worry right now.' In the prseair, NONEX psrses 
vcb used «rt thin noun groups, so that us mrw^iwt 'farced' to sake case* nuaber, 
and gender distinctions ah adjectives tm 'd***riflii*rs Irilfara the nec e ss ary 
inforwition is in. in ths noun group, ths ambiguity will only rarely persist 
beyond the po\ni where the main noun Is paresd. Cods is built Into the noun 
group special iwt to allow us to go beck end clean up NONEX per ••• (I.e. pick 
the correct feetore 'oett uhen it Is possible to do so. Another place that the 
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noun group special I st usss NONE* is In parsing prc»oyqs. Us say not know the 
gender or nuetoer of a pronoun until jts reftpfwt bep D lpfeq found?,-. (Me «ey also 
not know the case, but that is handled ^bfifcup Instead i see the ns*t 
section) . NONEX allows us to do t£»s ^Vyo^ajn^qj a pronoun, aaybe using 
eubject-verb agreepen^ tp listt tbj p pofs|bAU^Ss,, ijben the efferent Is 
found, a process usual ly rmt done until tN end of ths-.sentsoqe. M« -can o lean 
up the pronoun node. _..-, ■...',. .,„,_. 

FIX and JilONEX, then, saks jt sasisr taJhandle^a^li^M^^P^^iW^ M*V 
do not, however, r.elly cose tp grlpl i^ith soae^pf the dssjjej^olMeojirO^ 
pareing uncertainty, w^ich e ajr r a < Mae^aa^^ ? aaj}|ljB%^7^.. tl , c £ 

2.6.3 Ob jwts of the Verb ,^ .„, ^ r . ;t 

While the eai n ysrb In Engl Ish alsost *U»SU« pp^cisdes U« objeate, \n 
G«raan this is such I sjl| fltten the caw. At <wes ; senUpned,,aj>ovs, whensvw 
thsrs is sore then one verb, in <* ssjor clause, ^4 bi^ thsfinlts wsrb go to 
tha ""^V AH * d *M J ^* •asJt sscondsru,c4^sss eye) e n d .pnder , >>e v *M verbs 
* r " at tt10 • n ^* Thu * ** .* r PSM«!lWy -t,^ .|Q^HftP^J^»^«^fl^1|)n^ya»ftj c$Rje«p#efe«<>f the 
verb with no inkling of what Jhe *al|i t varb rfa v ^ qf&h&i B^ti^t^-tim-- 
s i tuat i op, the order ing of dj fiensjjjf. object* *eu, depend on; <Mhj»ther -.pnsjOf^ both 
of th ?", c ar « pronouns, and I jtfiet sort *f pronouns the« are. The-eilue^oiMs 
cospls* enough, I think, to fores a factoring of the or obi as. To this end, I 
have esde tuo distii^tlona^ one of Jf>J^ 
seeas less satisfying. a ^.., 

Let ."■■■* ir, | cpnsidsr ths ; eors Buccet tfu I ■essure. Object parsing was 
d i v i ded i n to tuo passes, the v YJ%0&\*XTSL rout »»e- la*jajhj$e)et , , ca^her,~,|ban« a 
group, routine) and Jhe 1^0^)^ 
groups, preposition groupf, stc, and tb«a WpRD-OBOp^OeiCK decides whether 
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they ari in the cofrett order. This divitlon of fiber had nothing to do with 
efficiency or * vliton of "what p«op l« tk>." It u«# lore I y an attempt to avoid 
a rat's naet df coaplsx programing. Idsdlty, the search lor objecte and a 
check oh their word order should probably be don* in parallel, but the only 
disadvantage I can we Hf th* may f hatir factored th^"pp&fea is that It eight 
take a I itn« fonoar to reject a bad pared en ordering ground*. 

A second aspect of the graaeer's handling of objects is the design 
deciaion --trm tha^caeiof a noun graty Is a hfgner revef feature than its 
geneW ; or fteibafV TfflH Is, at it stands now, thi noun flroup spec! all at eust 
a I Mays be cat led^ulth a cam epeeMId; : '"tV^1^f^W'eWng We Me that 
at a given point in the parse, noun group case is often predictable. Thia nee 
a bad decision, since if us sake the call (PAftSt M "Aft W if there I e a 
dot rv* noon group i«a*h*r the* w\ ateosef fvW oh*, dtaf noun group special let 
h1 If fei I , never ktww fhf what *rt eHisettr" Tif 'Jftirliitlvtll to peWft NG to 
be called without caw •pacification artd & *havb^ft ^i# r aW neW g/bup. 
raporUng back lNe^sjaa/ the* ^W^nbW^^hW^oWb 8 be^ VdTitivbly eiapte 
cnawyir to aake, although we could ost inWmiibfV tffth poelTbid aabfgui ties 
between noahidtiva ahtf eccusitfl ve :, e%*WWWu1Nir eiilgu'lef V feainlne ^ " ' 
singular, and p Writ*. A* -"ft HrtftaJV i*jeV'l¥ would ^^'sdW a*tra aechaniea 
to hehoHe* thl**, but * steals change ^o afiow na^yW^I'tlbahiW'oV a pars* 
(i.e. "parae aWyttflng fM a nealnirtVva houh group") would bb sufficient. 

A II out ng the nbuW ''d/dtaf apltfilTft lb WdNftii without case 
specification would improve the efficiency of VMMtfi££tfe*«MV several other 
rout ine., but it would not change the" structure Wr'¥Mk&£tlS great ly. Aa 
it stands now, VS®*8BS6CT§' i« c ^caf1ai'bbW ; ^si 1 th^'ilirver1) has been found 
mm uh*h it hat not been. The rout ihT works froli^ snooping list found in th* 
rotifer PtLTiA. H th#i*irtf varrhlMf beeTfouh^ list of 
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the sorts of transitivity |yp»« : .ot^|fe|f,^t^pi9y ? tlA» A >(|^aff ? t|tff d§acribed 
In iKt ion 2.3.13). If np pain yerp has |^pt^|^|i|n e Ho^g^ - ; Jkh« x ,|if| i ni f ,e##iff« . 
progrea can usually tp|( Mother MO erp looking |pr..^tJL*fe ©| r #a|tjvf 5 fHi|«ct 
types and set FILTER to th«M. In southing I iK« • sscondetV clause, FILTER 
is set to alt possibilities, of which :: tbjre ft a£e ,^ra^||y 24 r , K %t» *lH|t thia 
is not at inefficient as U seeps, Bjtpce any sttsept it parsing f.jae.tjtf 
objects will givs us inforaftipq, tbaj ! can, be ussd to update $he^sal|iUltlee 
in FILTER. For ex sap It, FILTER^ is el Mej^p^dB^e^ 

with noun groups cons Idered longer than p|;epo^JU|na t j(l,|| A+A before A+P 
before A before P ). Thus, jf at my pj^t ja^fjiMlf f|»^ • noip,, r e£eja*.;.*pr a 
eecond object, mo can ei ialnaja aH double noun $^ t $IWi f f&£$i'y9lk>u Mlth 
the exception of th« protolss h) th noun group cast specification eeotloned 
above, parsing of verbobjects proceeda ir| an,o^^l|ijn^ J|||^^ v eipc^»t 
fashion, even when the at In verb has np^bsajn^ouiid^ ^,,. % sr 

2.6.4 Liaiting the Parse 

Mhen I sterlad this prpject* I oa fvaly 4hp^|4l^V;*arpMw ,;foff^«euld 
be a aiapier patter than parting Engll ah. ^%t^ejoea^to^t£laitatltc£ Has the 
very thing that gave the aprpholpgy ; :Coapanep>,auc|ijs, headache* ,4hp abundance 
of cats, gender, rnjaher, ano: person distinctions. Heture' lengueaasr ^wever, 
are very f i ne I y ba I enced. The for wn syntactic cooponents sr* carefu Hy 
tagged because word order has a. puch.j^djej, degree off le*ibl M ty». I already 
knpu that objects of tha, verb have ,aor#.^ifead©e fen Gcrpan than -,.^n '&*§)£#£■ 
"Den flaw kenne ich" is perfectly fine. tf£M^ fa'jfa* A\fatf\\u>*lh*'*** 
know I"), even "Dee nar^n.gab Ich as" is not eurtpp.i#in|| f" I -gaMp, 4t 8 toj tt*p 
man," literally. "To the pan gave I i,|"l* , i; #P* ■ &fc$fa&!^* : &P *&k~ ■ 
trouble as it uas being developed wws things I Iks the possibUlty of post- 
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fixed, as MeH •• ^e^»He< prep«U*oo.<roup«, •d}«ctiv«« that take 
preceding hp^ graMpe a»ob<joeter«*aB*e*^ poet-Mxed - 

: gem t 1 «** IhPt^aNlR'sa^ llHptui- jfa-tirbmr at In. •■ £ng)l i eh, 

et©*» ate. fn this tact rem, I would Ilka to run through a Mat of the net hods 
used to fceep the parse*, on tlur'trac*.^ Soao^of ,ttM^.i«««airo»..-«aro i :iiw*« 
■ desperate then ether*. *ut 1 trlaato »fr4»a«i han<M* *h«i In a unlforiii 
: ; s*nner* a#te*0:r-4*»ts«na,-?o*hor vsry J e por tent osc luw lssr lor .' J felt ing tha parse 
Is sesjenUcs, ifetah H Otscuesed in the -next three Chapters. 

The first «#t of «*tho<*» for I 1*HI«b tha paroa c«im built Into 
f J W>aWMH*R#^4t **s easy to ' eo*e erosnd. the perse '*teW..»«nd - to i nterrogete #hp; 
naxtr- rior* %n tho -aon t oncs absut 4t* fsetaree. The CUT variable con bo sot to 
pisiiMMi^arotae if^^^ - 

i*iePs^^aH*h&«*^«^ can "too recorded. 

One ■ way eeoseg* »««j?4eto*** +wre u suetee was to . prevent a 

second par*s ottosart when ^ ^ .ffor exepplp. If 

i»'>c«ll-4fVIB9)&la»:IGCI end^on iPewisoii^jia^^ 

un««cc*»»fM*t« :**t«*l»*«d at this point, the noun group r»duf Jno returns failure 
.«*-i*io«j^-'-png»ropoii^ng>.'- Mthpugh there Js*sosjs^ovorl»ed in setting up a node 
fhereMhe^n^ sods, ehscM at every calling 

poln^^^l<d^i)«'<oMMM^o«e. v;7heroLfors,^tie lii#^d^t ; v«gntoitlc;opeci«l 1st* 
More «iedo weapons ttoio ^^^lmdt^^^Vhl^"'1^^mm4^^:■■' '<*•,.■.'■ : ' 

In addition to the use of se^sags verlabtef, #*• .progripe- do soee look- 
*w*^*Mlm>mMmmt< ;' Um-mi «urs t*jsther- to OS happy yl th thl • opproech 
or nevt. fhJ»r«'aps;||Pthi«g:: ; tnhsront lu wrong #fif looh^alMNId In text process i ng, 
but I suopept that< 4»t should be oorss fultg integrated Into the deelgn of the 
. paroerMee*'*eecttoh*2.7*2i* i «PjhoteM^>:4hp^eee f '/thp^look-ahe^ used uos elnple 
end roasonabty erffeetivs* An exeaple is the prenoelnal clause I Ike "die In 
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Nervensystee verlaufenden Erregungen' ("the exci tat* one that run through the 
nervous system.") If there ie no preeewt-or -'.(Mai ^|»irl4ed#la-anuMhar»'<'.ln^l , M 
sentence, a call for a PBESP or Pd^TP clause wM I use Jook-eheed end *et I 
immediately. If this wore not dons, ue uould probably «nd up parsing the main 
noun of a noun group at a verb object and go to a lot of trouble before it 
became clear that no pronominal clause was pfeeenft* Since pronominal clauses 
can occur in Juet about any fu^J noun gceup, thia could s feu down Ihe ayetem 
considerably. I should note that the Jook-a*eed mechanise here could break 
down in very long and complex sentences, where • lot o| different syntactic 
structures are present. As it i a now, t f ..there ,, t^^ m ae b e r gof <■ the award ^c I aee 
anywhere in the tentsncs - even twenty uoro^ away - the ♦oflk-aheadwi ti #• 
satisfied and a paree will be attempted. This rudimentary look-ahead, then, 
ie not a panacea, but it doss allew the parser to take slept e options for 
simple ssntsnces. 

Tuo other parser- 1 lei ting mechanisms ar* more conventional* Fir at* at 
the beginning of the more compl ice ted reulMiee^ entry conditiene were eel up. 
If the next word'e poss ibis parts of speech do not eateh , these in the entry 
condition, failure ie immediate. A second eeaaurc is to distingufeh three 
levels of noun group: FULL* SUIPLE. and N8-WQ. liken PMEfcl >%m balled for a 
FULL noun group* it ie free to try anythlnfl^ SIMPLE neu» groups exclude 
rankehlfted noun groups IRSNG - like "Ich weies, djeji «m 4iafaEL iti" / "I know 
that it xm. ltUft«"J ^Inal ly, UMBO rwunigrjaue^cajeea* be R9N0 and mey not have 
rankshif ted qualifiers. Verb objects in eejor clauaee are pereed with FULL, 
while objects of prepositions, im,:mme\m*i-.^'im*&M&$Wto£» NQ-8SQ ie 
used prieari ly within pronominal claueee to rpmm/l emejeddtnf. (Actually; we 
night want to allow embedding to pno leveiof PBESP and PASTP c^eueee. Thie 
would be a simple change.^ The ncxing^^ 



effort* m$ I4mm*-*mk. #»*fc **m m\m»\* «r**n tna oaneronty of the 
groaaor* th*iU teoof wh i*i ln » >*»«l* i » hwfcwi n» i ' Klwu toiT, anrf it la fair to 
treat any MboiUMitf •* lit^aloa onU a opart » oortytji iovat a» a pythotogicat 
edoe* 

**ne#J*oY oft «#* ol«* feMo, ther*^» ***.<«* Ifce reutinee uted In the 
•yat— MiH oa adoeriboa\ om» i ^ a ^< w ii ta tt^ <♦» > i fKup ■aohoniaw uaad la 
not the he** ao ooHU * ana. Ii^ootod r a e da r «> are referred to Hit f (19) for 
an en*m§t,e of eo a tap oatK o da and a aa teript ien of "e*»J id t backup. " Ufti to 
the backup rout inea in tat graoiMr ware wr Htan- oofy ad tb« need drone, the* 
• do hand** aMW «*> the ear. oonaon heard** ' itaaV ~ % t*m ; 'im* Wnr'di* ^PtM^peUP. 
!R*J9»^«i»JMil»,m«*^^ The tt**t/VS»-WC*Ur\ I. ' 

epeeMte t# <»« injM w , I to Job it to ^OOP^ vorO torfenHKtee tftdt were 
•****lmaHMMrtm*ta*ih *' ■♦^•a#Hlail- t w tfd twt or Hht : flNhd ayoten 
distinguished between vwor owl louor c«o» letter* 4dAiK^ ; «dHiid i*e ; ; " 
cap Ha I i sod- J« € w w t if te> TfWC BnC«y »a e»*1»ad H tha eat* verb fa parud 
a*** ito oo)ao*» and 1*^h»-^rdti*MJ*Wi peedfoUitle* of the verb do not 
ootch tho oO)oe*» *oood. Thl« *«wt*n» paj^npeeo fren tha troo and aota the 
nocosoary r*gi«to** for anettar dW to the dfflB UBJBiTS routing. ' 

S©SW3&*in8ii»J tapon action if the ob^to e* a addi-clawee novo boon 
poreednut 1* thoeo l» no partfeoifHo or a #Na>t*t#*r*Mf*e -W to fdohdv Yhe 
woot'ine Qtmck* to mm i&\^ M» vm>* *M ******& mv of the object » of 
the vnrto. If oo, it pop* t*» «bja<*«, soto * oot fddtffc o* -4hjo. vorb» and' { 
return. contwoJ :,**, flMaMIK J!ifM*i<i^ 

novHnaw. - SnU- in C niyr ♦poctot-i »o» tw ^oaa Garnen Smarted order *t*uade In 
which the finite won* Jo thooolfi >*a#b. Sine* tha ettb Joe t follow* the verb in 
invar tod order r there is iwthlna to ptwwvt a aub)oet frtm aboortotng ' ' 
proportion group* an* ehjdbie thet Aaionf W 'tho tHOuw. (The oooo is true 



-S3& '*^**>«-''H*^^?^p^ 



53 

of secondary clauses, but this i a hend t sd by TAnKS B ACKU P. ) Soaant ic check i ng 
helps aoae, but often it la not until ths perstr eetuetly dtrhed tav.par,a»p 
objects that it di •coyer* t he t aoea thing ia^rono, As ths ays tee elands now, 
a subject noun group wiM not aistsksnly appropriate a preposition greup (for 
an explanation of this, see the next secUffl* but jit could pie* up a verb 
object, thinking the objsct h«» a pwltivf. It is. the JeAt^ffSUeUr^ACKlF to 
intercede if the pain verb^coi^s.up shpr|,j|| ebjapts, and t« check to see 
whether the subject ishoWlnfl on to eope^hen^it eho^jy*. ; flfeje^i»prob»«*i 
with inverted order c I ousts Hheee flnit| i^e*b r i| r jpifc l^^in v«Pb are the 
saee as for secondary c I sums and «*• haj^dlpd^hy TJMUNSrflAOaP. 

2.7 Problsst sod Observations , 
2.7.1 A Sticky prpblee^ndffartiallol^ionv 

Ons difficulty with using backup feFghituial language *an»i no is th*t, at 
anu given point, i t is not always clear Hbether beefcup should be in i Hated. 
In this section I would lik* to diacuM a »prohUeuhteh occurs in English, but 
which appears in such acre florid *<*• ip-^eeeajp, , The solution proposed <*oee 
depend on backup, but it attempts to einieize the instances uber*-/>l«coirr#ct 
parses will rpaa in undetected. t .:.-. 

Consider a sscondary clause f roe our s^aople paraaraphv 

Da ab«r Manohe in ear Tie fees t s ps gdan Ti*te»lieehs . fen vareehi edenen 
bun ton Farben erstrahlende Lsuchtorgane besitzen, ... 

But since Many deepsea duelling cephalopoda possees light organs that 
shine in various, briyht colors ... -h\v:s- ^.k*- ; 

The structure of this secondary clause is binder-inter jection-eubject -direct 
object-main verb, and the correct division between subject and direct object 
is after Tintsnfisghs . A progree parsing along blindly, however, could 



appropriate* -the aw op aoi 1 l'ih> gffou|» *♦*» wuH I s tt« o » b aei i w FarOon* •• a 
aoafcfcfrter tor the eePjeet l*0|eni£aj&e> %l<— tl«tm» ' : the parser would never 
be the ■ Mi-tor. »twao »w i tr o h»«« ss L o u a wto* ^Cn i f ^«^MMMteet(i>iyoo<l'«MtNtetiv« 

now* group* fononjUi La ity, tee* Me eedli* ; te«iNI H&(^4i&iWtmW i: pM*0rm 

In an sttoept to o»tti4 thi« «ort *^ t*to»t>«n 1^ noun jroop op«cV«t iot 
t* cautroue, andi w h e n e v e r s noun g r o u p couM etoos p b preppy mc>n groups ' or 
etfJeetrWYd groups tmmJSWBMM that bo wtb»(ws to It, It wilt refrain 
fro# doing ao. Seen proposition group*, ote. nttf then -pften eiat up bound to- 
the najor or •tttwrtftneta c*aw«s ned»» Th*« WMi6* : *»MbJe# : f*ie, batataee*'!'^ ie 
.often., the eorree* ene* Since scsd sa ic Soraan ■ a s he s auah frequent use »f the 
pronoalnal clause, *t U tese Hkety that slepfe rotation* will be expreeeed 
with pestnoa4ne* ouoHtior*. ft, on the other hand, tiftp to the wrong 
dee Won* -ee^eto^ltt^rMt^ aa-for' or 

subordinate ot eisar tend to be a^^ ffietr con powew t noun 

groupay even m*akt*¥mto- weefc ■■■»»>€■ coepenent etght be aB** !f t6 ds tore! no 
theft • cleuoo teaeooee thins ox tra, even th o ugh triti#nT^iH : 'te a^'oe 'for' e 
noun gi uup* 

Note the* in the e*aeeie above* aoa**nihg ;: »l» ;; 1i»j*s^ 
Ferben" to- the s e bo i ' flt i i i s t i etotwe uoutd hi iae o t re e t, eHtee W re»J*hj bo longs 
to tho preneeinel eieuee nedifttfe^ t^ $»e*n the sonant ic 

specie*}*! for the cieoM, 9CUUBE, to caJied, it ui H presumably decide that 
this proportion group cannot aedlfg the ^^ofetlon r o preean ted eg the verb 
hoo,t»ofn> and tr sealer control to * backup routine. o oc e uaa of tieo 
neitat'rone, tbio beeteup cede he* no* been wrttten* but its basic teak uobld 
be to detect jurisdictional disputes. Ue already know that the preposition 
group doee not codify the clause* but we have to cheek to see whether the 
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ooapoQ«fitt,on..f>t^|p tidt-H^t *©sc|iM» tJUt4lieM^HFh*M*- , *aVit* Bt } c - . 
inforaation - the entry conditions for differ*** tte ^K Hi end. the wags they 
may terainate - we could sliainats certain elaiee on the proposition group. 
In the exaaple sentence a sieple syntactic check is not enough to decide 
whether the subject or the direct object should got the preposition! eo ue 
uould have to depend on the fuller syntactic check of a parse atteept, with 
the accoapanying seaantic checking. The backup code could cal I for the two 
different parsings, which in effect would allow the semantics prograae to 
direct the parse. This acheee is siaitar to one outlined by Moods for English 
(48); the difference is the heuristic that aeeueae that aodlfiere belong to 
the doe i nan t clause unless proven otherwise. 

2.7.2 Other Approaches to the Prob lea 

Uhen I began writing the interpretive graaaar, the quae t ion I waa asking 
was how to adapt Uinograd's approach to handling Geraan. SHRDLU ie not the 
only way, however, to translate inforaation like that in ays tea ic graaaar 
networks into a pereer. In the tlae since Uinograd's aystea appeared, there 
has bean soae interesting work on English parsing, notably that of Martin (24) 
and of Marcus (23). Hart in* s approach takes the fere of a pareer eiailar in 
spirit to wood's transition networks, but which a I waye expands all possibls 
parses instead of atteept ing to chooae the aeet likely one and backing up. 
Martin contends that aaaantics will Halt the nuaber of poeelbilitiee at any 
given point, so that coabinatoric explosion is not an issue. Marcus propoeee 
a "wait and aee" approach, uith decisions delayed until the necessary 
inforaation is in. The ability to delay parsing decisions would be 
particularly useful for Geraant it will therefore be interesting to eee the 
reeults of this work. The aerit of these approaches ie that they avoid backup 
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with its accompanying problems of programming complexity and the possibility 
of overlooking ambiguous cases. 
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Chapter 3 •— Ordering ' C*PC*p# rt"J*aro 

3.1 Th« Conceptual Structure 

Cone»l der j ng I anguage ae a •*truotur«fc organ! xed Jo convey eeeni ng» " «• 
have discussed th« «truc4ur« p«rt U Ml* «>^«ftt an^fKJM mjH turn to tiw 
quest J on of opening* By the eeening. o|,# *ap>fV> I ajelov it* fiTOMp e* entr* ee I n 
the conceptual structure associated with that word. But this, of couree, 
■eans that Me nou need definitions far "ef^iftf and; "conceptual structure." 
Another word for entries hare is concepts, anaV<^%e^eftB*jaji|optJ«n just how 
concepts ahpuJd be eejjodied, Should h* use rules, procedure*, !■•«»•• . aoae 
coabinetion of these, or soaethJng fJ#» #,^asp|har,? v Should ,tbe .organisation 
of the conceptual atrucJUre he ,* net uMh n% constraint* on, I inkage, w%S 
highly structured hierarchy, or, easln, •oeethino. slf*2 Since no coepooent to 
do underf landing has bean taplsaantsd here, ay answers taJtpej; queetione Mill 
have just enough specification to aet Wets other choices that oust be eede in 
the sue tee. 

For the pr o p o es d deductive co o p oii s n t, aeantng, can toe defined in terae of 
certain syabole, -data structures. :and4tfograea«^ Entries Hi thaiconceptual 
structure are t heor see and assert hone H» a eeductive prograeeing Isnguaoe like 
Planner (181 or Coant>§s#f 126). For the eeet »per%» these aipspfcthar direct I g 
or Indirectly associated with entit iee c«*ied coecep* eepfcers. Within the 
eyetee, the coocept per Hers ere pri«tfci*et% e tear A ien • ir* the deductive dots 
base are bu4 It free thea and r aa aeMNiaad* tbsp re a s are foooociatod w/i tf* thee. 
It is easy to iaag ins another level in the syetea with, say, visual ieages. 
The concept worker* weak! thsn point .-.to vtl«a»ftiBafea^ra»id^1»^oea(duree would be 
available to do operations en thee IMie tespeetlcn, updat^ng# staple 
manipulations I ike rotation* stc. 
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The chapter **■* -toMm* t ,kmvmmwt&^^ l ~mi:*w*W* barker • should 
be ordered* and «he*»t«r & ooet iM« sore dst»l I on the Kind of processing us 
Mould sxpsct fros S <Ndttctlve lu apei wit . ths idsss dlscuwsd hers ere not 
origin. I, eue I *•**«*!• ehepter, end fehepter 5, **«H bs helpful in 
asecr-lto 1 ny soaooe^h* Heswos twH topes mm 'spi ^g i fre j M » ^J$»r per to of' 
the sye+ee ere s^sp ^ m*erec* e«e«e*liiHU WHh S deduct I vs cd ap on oh t * 

3*2 Objects* Relation*, o«d Proper ttos 
A« i n ^ i near adf • *v«t*ft, IMr ^things 1" s *JsT ; leWnf «tt I' I tod represented as 
objects, -M*«*J«to» -#e* pheper**ea* ; i*|^ ? *ha^ ? ^liie#^hree ettegorrps tH 1 1 be 
ussd fro* iWH'OfHipiNi ee distinct h^ ttf thin the conceptual structure, not to 
naae ,*h*«g* in theaieeJueMd; -To err sirtem, - l'«fc'*#r1.l'fng to consider these 
thr^ categories e» prleH I vet, sl*we*H is eWHcuM* to toes up with water- 
tight definitions, A* en sttsept, houewer, **t us s*u thst 4h* p*oc>er«f 
oreat^nq a cwwp^MSt ob^»et stfaot herty >d( f fsi * en tle*es fh*!part;»f the world 
to which it corresponds fros the rest of txperiencs v||4|^ sHpei^ltiiS"'- end ; world 
v 1 eu«d broadly * net just reel tty twit l ea eiiio^p »S j d rl » nc a»» ? «*&* ** we ' ' ) • 
An <ei^{eet v >.i«be^^eM^ to tor, ls«> ss H el w lB S F>V, ^eprsBsn»>> anything 
M j*ued •tsticsMy, *het .-is, -any. p i nno aa mn soii f issi' + d pa on dfrtKy. "AH Object 
»4 ght be a physical oto}sct I *He #H*XK, an .-ftpMpfi WtisdPFMfc# JT^^Y-MAtk, * 
eerrtal 'phe no as n c n 4 t*a ~a PORaWl, sees abstract entffty M*ie ifHUfH; etc. ■ The 
sharp eio* hsrt is ussd to dl«Ut^*eh eofe^ra^ ^,part**f the 

tu»t*»*s intsrnsrcoiweptMSl «twi*tarer fros pl«^n word*, »**i^ ere , parr of 



• -,. A rsla*ii*i bi/Kls©ns or eore entities, tbse*ee* tups of linkage ' 
depending, of cowrsa* #n t^i^P*** »raletie» InvoNed. The ent i ties 'bound 
nay themselves bs relet ions, eay be otojsets or pr eyi ir t ies, or say 'bs* eose 
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combination of the three. One coaeon typo of relation it an events ovonta 
aluaye Involve • specific point or span of t I *• In addition to their other 
participants. Properties are relations uithcwsrouaonte. Tbte *e the only 
way that properties distinguish theeeelvee froa other relations, but the 
distinction sssas to be a useful ons both conceptuelly and for purposes of 
implementation. Properties ara used to describe end eodHy objects and 
relations, as Mall as other properties. To rear aaent this, the ays tea haa a 
special rslation (called, not surprisingly, MMVEerTOPGnW) , uhtch links 
properties with ths concept sodifisd. 

Our three conceptual catsgoriss correspond roughly to the Jobs of 
different syntactic structures: noun groups of ten refer to objectsi clauses, 
preposition groups, and sons ad j set ive groups represent relational and abet 
complement and attribute adjective groups represent properties. This 
correspondence should not be taken to isp I u that objects, relations and 
properties are derivable froa syntactic oatsgoriss. The conceptual structure 
ia an independent levels in fact it It not surprising to see high level 
semantic distinctions reflected in the syntax of at least Engl Ish and Gere an, 
and probably all languages. 

3.3 Sslsction Restrictions 
Considering the huge classes formed by classifying the world into 
objects, properties and relations, it seems I Iksly that particular elements 
will be eore useful and accessible If the el esses ere structured in some uay. 
Since any choice of structure should be influenced by function, let us look at 
the uay the systea Mill want to use Information about the world. Basically, 
thsrs ara two kinds of activitiea that we want happening In the system. One 
is the uss of seaantic sslsction restrictions to eliminate incorrect 
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interpretation*. Ifee other ta^tfei ^-WMMrfutlwi fuW rafcge of roel-Morld 
i nf or aaMen to aai»a^ finals ^ 

Q*tar*inin& proeaia*.i^er enee». o*o» Theas two. fuGc***** aake «Hf ferant ' 
deaandacin, teraaei **»fcii»devo#.t«iBirtedpe<ei^^ ewtUMiMst 

a*eUy4,.ae:#M»^e^ dgMMtte ' ...' 

r»»tr i ct »oc« afwjk # m i uci ^v» i n t«p y>tit twfc , , 

Sal«cUenraa4rlcUona irxtfaste the cwter liaU on what types Of 
concepts way appear in a raiat ion together. . ■ Bar SMsapta, - 
Da» StficK S»lf« i«t lietwvoU. 
T^.bare%seap4 te^^tleaatav ..,. 
la an odd; •entaocostHwauee affection ie;en-attrlB«i*aAO#. hWMinet . eaybe of ; - 
anieal* too, twt it eertalnlyiCa^n^ thing. 

(ThU ignore^, ;;oiree|Stf^ aeTMoH'-ae any 

sort o-f mataplOTicei speech ui II j^ ^See^eectlen'S** for 

a discussion of ae»a of the isauesiniKilvvK^ mo want the 

ae iect i on raatr i et i on associated ui th affect i onata ta«*e felY!fiS<-THING. Such 
a, selection raaArUtion gives ua a criterion -for rejecting bad -paaeeee*- :; tf thai 
semantic compotjent ever finds itself trying to ik>tt jjffactian to aoap in a 
straight scientific text, it will find no possible meanings of liabfvoll that 
ui 1 1 sat i of y the selection raatr ict ion, Semantics Mill than f ai I , which Hi 1 1 
cause the syntactic cosporwnt to try anotnar paraa. ■ ' 

In ^addjt i oa to raj acting amperes p so s ues ne meanings of a word are:- 
acceptaoi e> tha sa*ar»tic component could eJ so uee selection raatr let lone to 
el i mi nata pqeaibia. aaanings o# a-Merd*. This ooaaactoaily happen* but it ia a 
rather tricj^.bueinaae* For ewmple, we might expect that ee Iect ion 
raa tr t i cjt i one, equJdVheAp, «#' o«.t,in die*4ng^«hing J»tMeent 
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(i) Per Flla tfciaLi heute Abend. 

The flit is pla cin g tonlpht. 
(II.) Unter Lien|reiji MitilU dip . ChrojM^pphpraii. > 

The chroaatophores pJiy uhsn at I aula ted by I ight. 

For the Meaning of the verb MJJlUa jn (I) we eight spejpi|y that the flret 
argument of the relation should be ao|w.tj^p^ji^ip^|(^^€W[ATI9N, 
i.e. a file, play, puppet eh<»i, cabarat, ftc ? |t would-not be unreasonable to 
expect the Beeanticaarkerp for theee phenoeena to be cla«tlf ied wooer ,. J . I - 
^THEATRICAL-PRESENTATION. For (ID, however, It if hard to say what the flret 
argument ailpht be. The light* on a .aarquee c^play, lo„thAf sepee, the 
Northern Lights can, chrpaatophores, of course, t «jp ..,- ; aveo found* can. |he 
unifying character I it ic here. seeps to be th^t thfM |h|ngi Jpre ...a systsM, 
Hhoee individual aeober* per fora their particular ac£|ytty (fljeb, eejlt.a 
sound, aove etc.) in an apparently (to the pjrcfiyer)rano^ t order. Thie i* 
obviously not a staple characteristic, ancj pfpbably^ljje b^s^^Mf. can do for a 
ae lection restriction here is soMthing very general I iks #UWC«ETE. ... Since a 
thing that i a a #TH^TRIC%-«fpiTnTipN would also be class! tied under 
•CONCRETE, if Me are given Mntsncs (i), selection restrictions alone ull I not 
be able to tall us whfther the file is running at the theater, or whether ho 
ehould expect to see It flashing on and off. For this reason, el though the 
selection restrictions In .the LSystsa do r sees* Is .thj^eeaintte. isarkere of Fodor 
and Katz (6), they, serve ^different pj^rfflsf^ 

restrict ione are not expected to give a f4iM e acjBoua!t, ^pf even the Major part, 
of the Meaning of a word. Soee thing as siaple as a selection restr Jet Ipn will 
not be able to represent seaantic constraints as to which participants May 
take part in uhich relations with any dsgre* of aeouracy, What a selection 
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restrictiOT can offiW' ia • neg«tiv« t^t««rlo*t tor »4HWfl«t1rlg ifcpoeeible 
i nt«Pi>re tart Ions v l« tM» wa^ th*tf cW«l qu t^# uwfu l flirt- giving feedback to 
the parser, and eeedtii**; but hot a I way*, -f^ei^ii lielpfwl In elieinating 
impossible irtteHrdtatiOn* d? * lejfef** **« 

fibre detail an now •etect+ert^N^Hjtione ere weed will be given in the 
next chapter. DSdttfsW-^feWdcusilort itr ^^Ne^^^'aVelit ''prediei that 
se I eet i on rsttr lethirte cen b#' eeee to work MeH^r^M ly ut th a etr J ct I u 
hterarehle^efrltfiur* \-«* sln#^#sd : naV?Die% usi^lbK sefeetional 
restriction processing, tiitrl the 'fflk£tte#-Ati&ift ^leWis'brf the ''tree the 
gertirat dne 6f cfilt eebbefshipT Note that a single classification eeheae is 
not the only chdleie for ielectlonrsotrictiont. Multiple trees are a 
possibility, as f« Minogrsd. Operations sir a single tree, however, can lie done 
with a minimus df tins and effort, aMIi^lnle^ihe reaeon^for using selection 
restrictions is iff jctincy; it seed* trde ^"fctsVlshoice. 

The se lecti Oft-N^rtetfert tree should be orgenTirstf to el isinate the 
largest nunber d* 4* tdrprdtet f one oddften at powHstev but beyond this goal 
the organ! zatleVrf%*«i'*#«*s to be a Question df be lanes and '* setter of 
taste. GlvSn d constant mi»bsf of esnedpts, lais dspth «ean» a faster search, 
more depth tidah* thit the selection rsdtrfctffons eati le*e 'finer distinctions. 
TH* f«fattire'tH«t^s *»»enti»f hers is the hfererithiceT Structure. Thus, our 
tr#* *lght h*^ a top rtocl* labeled ire 

m^jmm^llftm^m^wm^m&mm,- 'T*r«piftsr*ndaWb'irthe relation subtree 
drd WHitoMV#***r1tt* Here, H&ttoi ftltftttttef- (n^lldef th fog* r lie spat i a I 
l&mimm* rmtedtk df e^ar^sdn Wit iJmrta^ta'YJ, sfc. *1«ITAL-PW)C6SS 
Include* mt#W*»*ue?Ve*#ttonb*, sndpsrceptasl prdeelses. tthie organ iaat ion 
i« IdftJi* frds>*tfi^^ Fngur* &2 SfteuVthi upj>er nodes of the property 

•***rdd, Tldfr tt ' PdTNsftWttW *o^ns>fflg*V : ^»^6W'*n# classes contain 
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•EVENT fflENTAL-PROCESS #STATIC-«LATION 



Figure 3.1 



•MANNER-PROP 



•WWRERTY 



•TETfWAL- 



•SPATIAL-PROP •TEHPOftAL-PF 

JANTIFICATION Ml 



•PERCEPTUAL 



Figure 3.2 



properties that aneMer, ths questions "hoM", "whors" and "ioW*. 
•QUANTIFICATION (teals with extent, which j» sltber nuebsr. for objects, or 
intensity, for relation.. *lOENTITY-INTEGRiTY contains properties I Ike 
uholsnsss and uniqusnsss, Hhi Is ^PERCEPTUAL dsals with any propsrtiss that can 
bs dstsctsd by ths ssnsss. Nots that this property classification is not 
exhaustive, but it floss give an ids* of the organ! iat ion of the tree. A 
largsr section of ths trss ussd in ths syttss appears 4i* Appendix C. 



3.4 Structur Ing ths Uor Id for ths PsdwaMwo Coaponont 
Sines ths sal set ion restrictions can provide only negative interest ion, 
ms uould alio like a finer -grained check to reject ths res* e# the poeeibte 
interpretation*. There should bs spas poottlve criteria, so t hot no can bog in 
to knoH that our choice sokes ssnss. For £h4o ms need ths full potter of a 
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deductive ogataa* The queaHo* <*K&0&.wrk\ti§ on I • hen the data baee and 
theorem .Qt the dooort i ™ »»^*ooohauftd»B orga n fie* . For a etart, 'the 
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Figure 3,4 



clea*if leaHont uee* bu *he.telee*|on rertrtetlottt leak useful. Note, 
however, -that. In, one centewt Figure && *4#«He#ia i ua#*tti ; clo»»i f Icat Ions 
scheee f«r oojocte^ while enother context eight fewer Figure 3.4. Even given 
the »*» generei c»wte*t. I.e. bie-lofy. esgchol^M; cheeistry, or the 
euper^erkeU Mnv Afferent cJM»tf )c»teru teheae* are poetible. U« want eore 
then ju«t a *tno>e ctee*lHcetiwi tehee* for the general data bate, ao the 
etrwctaem' Haad^nt tare** a great* a&treee* ActueHu, i»nc« the"- treat er« not 
coepletoly disjoint, ue can eerge thee Irrt© e tingle lattico structure. Doing 
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this to the two classification* above, «* would get the classification shown 
in Figure 3.5. 

What does the hisrsrchicsl strwctMce^f the deductive data base buy us? 
First, the Motivation for hierarchy is acaeoag pf definition. An ftQCTOPUS ie 
an 0ANIMAL which is a #UVUIG-TfJWG Hhich i» ICQNCREK «hioh Makes it an 
WJBgeCT. Ui thoMt a hierarchical ifrjasjfeap^jaj HPuW ^»^f»clf#lng, for ...... 

exeMple, that octopusfi bjiye soas aechMJ^ ffjo^ea^ti^^ being 

an ffANItlM.) , that they have soas sort of rj^Droduet Ivp »ust«« (proper ^g ...of. 
being a #UVING-THINQ). that they are perceivable;, aUnough pos»»blV aided by 
instruaents (^CONCRETE), and that th«y sr* ent It Iss rather than processes or 
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Figure 3*5 



attribute (preporty o* beir^ «n *QBJ&?t. All this #8a<»d Mi associated with 
^OCTOPUS* awtf exes*,** the seee »©utd have to be duplicated for HMdMK, 

In thinking about ths suite*, I hive titHee 4 eyielf to • single context 

mm * et«i*i- » t*m frf ot fr bJo j ? ** **»»*» *» dtf f tcorttei inherent W- adding to., 
delating; froo, dW r—f tf « W lrrng LWWJWfttUt l Structures, thee* ere ih^drilting 
but ttwj«r prubt e *0\ r**ited m they e*i> ft preoeeeet flit* fisef ning, 
hype*h*s4 tflhy, art* eWf ting fro* Me context to another. A work ing 
' frahe^»^eir#y*tl^ conslderebly greater 

f ie*+bf -l Hy tnarfU* one bomnetf hdre> 

It should be sent) on«d In poeslng the* neither Mererchicel organization 
nor moto" deffnTfliJii mMeefvee tU&mmftl Tafrfy tfff^ieuttiee with 
border I Ine eeode. 3§y. for Instance, Wet IfciroVlfe^fflK fo> 'living thing" ie 
8eif-repr««iie*tom If viruses r opredtt i e thiei p i ivoe but for other reeeeho mo 
are reluctant to ea*«l the* i ivinf tSihed* then thiol** dUe**s only if a 
eingte def f hihg? pt*episr ty Is alt that mWlm'mrmtviMk In fact* it is 
poeeible to set tt|^#p*dijp'Of : 'boroerll'nib categories de#necf by appropriate 
elxturee of prbpeTrt*** free ff*i'ie*)ey'e*toe^l*a. 

The hi eroreilftaii order aldhe, "' h f *i¥ ei 'V ~ n e e ds suppi aeon t trig, since mo 
uouid Hie to be* stole to explicit ly^epresent Other ordered relationships 
between coneepm The" idtfmohai itruituri*' ti "be TMrwJefced Mi i i lead us to a 
gohereJ graph . etHerture with no lllf-feepo, but th* presence of our underlying 
hierarehi«»l structure wi 1 1 cdnotrehi and" control the finiehed result. Since 
the lexicon ie going to be interacting rather actively uith the conceptual 
structure, it eight 1 heip to loot at the situation for words before any 
add it tone to the structure are eade. 
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3.5 The Re let ion of Words to Conoepte 

There art several ways of approichii^ the pp«blt# »f Mord definitions. A 
word might too defined in taraa pf a cl as* and soee WJShor of attributes. This 
eort of formulation is en old one - "eanwe'' end "difference" ere the 
classical teres - and the ttenderd fore of a j&eHoneru detrition shows that 
it is a faailiar onoj an octopus is %nnB|a o^omi»o^«oIIusKs hawing a aoft, 
saciiks body, a large head with a eouth on the wndereupfaee, and eight arais 
covered with euckare." Wars, the phrase -up s*e s !e«i4*ii^ail-#e||paaentai^he.'gemie 
and then the differentiae follow, t A« tfalr fwjee isaMss* *he differentiae 
offer criteria for distinguishing *M*tueep^i*ferej|t mop4# de*inejd wL*h roapect 
to the saee conceptual class. A definition t ,l ike^thi * cpuid J*Ji -wmk rUh m^ •% 
learning (i.e. non-static) structure, and it presupposes a top-down learning 
process. The genus is assused known and the differentiae give distinguishing 
characteristics for ths new node that U added. Un actual practice, the 
differentiae peg be used to pick out a concept which la already Known. In 
this case both aenue and differences are known* and the pnoblea is aerelg 
learning a new word for an « I ready f aa^i,iM^ «0JW*ptr*v 

This genu»-differentla type of dsfinitiw^ight hove its piece in a 
eystee with learning capabiiitiea, althoujh -k% wpuW^ajlthej a fundeeentei 
place, since a staple top-down pneeeea can net adeaju«te(#«Mdal a good part of 
the learning that we »•• people doing. Since. ho i» M o r > J *•« hPJt cone Metering 
issues of learning *t ail, defwilionswiM take a d^Nmpnlv ford)* The 
knowledge data Pass is assunad to he atallc ;U»r4haaanaa jlluit for the words 
defined one or aero nodee (i.e. ooneept ea r tor eJ er»*lways preeeht In the 
conceptual structure. Furthernere, ^the-d*lfa»enttoe^*haeeiiliund3N»'»f 
inforaation that distinguish a ooneept frea ether aeahere of 4te eJaee -are 
also preeuned to be In the dote base el ready*, T herefa r e, a word can be swot 



G8 

easily deffnad b« »»^«r*«>c« to the c«w»pt n<K*» or norfee it represents, and so 
w*?de<oa»r M**#**li^^ Since the 

•y«t«n e4reedu Koewtv etoowt tn» c cwee |> U ro^oeW tod fty; the rwtr I cted nuatoer 
of nerds i 4 ■ noo MW te re, no concepttHH reetnuetw lng -lev pi ft** to be hocaeeary . 

■-■^fte»-f|»)* the* » w« oo* a» ^u p p a *' i»« i* »|» ee nrte r — ewe tlfd* *orda 
are ordered HUth r a e j Hft < t» the uexe e MtU el e ^r u c i^ r e . Mete, hailever, thpt not 
at 4 conceptual t^ie^^icirie^ <*e^ toe r^lected le»t«em«i the* H> v teim concept 
eerkere eiflh* he^f» He«J» oeWne* fer the*. IWs eekee eense H you think 
of teneory - iee«»eeejwBnei- • mwrm ■'twMfKtg^^yvicmimw^iWmm- not- WP9»« to , 
appro* Htate the mm* e* Afferent eheaee of oeler the* caw be Wet Ingulehed 
and raeeeeerceK ;! eie'HMMMHMealBl* •e# , 'ee^eee< i ^ / ^eelilii« i •■ 

w*f^e- •^wi^i^BieBfBpeejr'.-.eepee-! wfJ* ^W""fc' ■ ™*J"*IW*flP* . ** : * eje)ps»jpjsBjiBe,ve»*-. f^5pwKe«iBfiB5w^. 

I4fi- ; meidi<' ; 
' With Hordt In place, it 4wMtaft'«*^M-te?**- «ie©»*»d prbbtea. .The 
oonoeptup* etrwofcir* e^ep^e^etf efeiterltiee ei^dlHerentee between eeebers of 
a, cle»e, but ue have m^mtfp wm^mtm* the eae ti i n of «r i*hb;le cKass 
. or eeepere of 4i fiereot ole^e* «eon» theeeelwee* ' A ph e noee nofi I Hte word 
contr«e4* su«HKi*^t'«|fl^^ '•« to 

order the dan w^aa»<p of a concept nods, but W%mi fliti* one criterion Js 
reJavent, beokkeeplne eooid becoe* anneylng. Am*, " of ; cowad,' . thle doe» not 
even acMreM the «t«44ep profrle* of orderirtfl nodet f»-oe dtfferent cfeeee*. 
The eoluUon propeeed i* the introch»tion -of •« lHi#**#tfc field. The fdr* 
"linguistic f')«M^^»^M>t->'niMtv.'.fp« : e d**e^e^«t*<si# i^tii**a>y, see Robins 
(31, p.iJiJ. On the tk»e4» •#***■*.■ M*<tie*e*o»^4^ should 

be *R .«f*ief4ii»:«*^ or, property eonoept* uelrtg a pfsaperiy- ae a 

cr i tension.. Ue do no* went to think w&imitWtimmmiUrtnt uorde-, eince the 
order ing here* ie eaeantfc and thue etrictig >epea**np should not dee t with the 
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* 
words themselves. 

To represent word contraats, ms could um .-.* and •■« at the extreme ends of 

a field. One special sort of two Beeper f4«^d^epr>eaante ^theieaoe whera tha 

8 laments exhaust tha does In of the f i ■ td' a proper ty, f or susap f e the cose of 

negatives. (There need not ha any middle ground between Iblgl^and %*ot tei g" , 

Mhara tha criterion of sUe applies.) Antonyms presen t an In taraa ting 

s i tuat i on . Take, for axaap I e, "good" and "-bed" . vileVMwana} i n the next few 

sections, I Mill use English examples, since the issweeeHecueeed seem to be 

independent of the language involved.) float people wowJdcaU "good" and "bad" 

opposites, and so their associated concept markers er*al I oted a two mimber 

field along a property Hke AIQRTH. But «Mt happens when we expand the 

context to include "greaft "excellent"* arxt^lauae^ , fi lffeMi raa pect to these, 

"good" and "had" undergo aaubtle ehl ft 4n meaning and ,mm no longer the 

absolutes for. which we construe ttd ths two-aeeberflaJd. Such word paire can 

be called polysystemlc. we therefore need one or aore additional fieide for 

the different f rapes of reference, and the process of inter pre ting a word aay 

entai I deciding which field is relevant for the i par^ot*lar ueegs. The 

implementation details for f ields seem to depend on the wag Knowledge wi 1 1 be 

structured in the systee, so fields will not be considered further here. 

3,7 Synonymity and Connotation 
In teres of the conceptual structure we have been describing, synonymity 
can be defined as the relation that holds batween worde that snare the eame 
associated concept marker, pi van a set of synonyms, th* present structure can 
therefore represent their similarity! the naxt atep ie to look into Maya to 
represent their differences. The first question to tak ia whether there exiet 
any pairs of words that are interchangeable in every caaa. Consider, for 
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example, tlw «op#» ^ftliaft *nd fiii. ihat these ar« eynengas Is exhibited by 
the*#aM^be* a |Mr<t» B «otrt€l » willtna -to uw alt>W to derwt« '.the etandard 
furry anfcoal -with. f#oar tegs and nwiibar e. ■ .^feyytdi*' :hia§*ei s , la a *ora ' formal 
•word end «w Mho to umo* weutd teod to be rtttrVctgd to scientific or poetic 
cofftawct^ ^at i« ,# a" i r itl fsjr b no t *»wb» Jha dleHoMtiio ii t wi ■ fcs the faoi I iar 
one jae U i oon nonnotat ian ana : . Jao a tia tiow* ii r floa a ra i,-' aba would aint to cal I 
dwwie-Uon t)M «**»e*4ual aMnittg-of «;Mwd -«tett »a >tavc b««n taming about 
,.oo4«r - and oowpgt^itin ttw p»wntM i ^ i Ofi o t> oC Utad ifriih ttntin of th« uord 
itself* CofK^toUoo <«l I* *» about the fraaa of Wls i once in which a word is 
beHia used and fltaw» i»fof«MiB« about «w ■ e o o H ei . s «-|iib^ : pNi#*rt l tao foroal or 
t**«one*»„ er*h»i* «»r eeaern, adueeted or tttvidoeated, object We -.or biased, aro 
^ oqmo <w©»c*« that-tuoht enter Into ; -«*-JO»oiitia*liitiof o word. Language '' 
exhibit* euchOQOaeev <fin- ether otmm £*#.' flor b a n n o bl im doctor* ion, boo Frog 
(7)) that it would be -eurpri»l»g to f lad sxtraHMjenoe at the lexical level. 
Ratio, i t imm uM.Mtefg that ftftbMr^'^iiaysieaiib within • tenguago that are 
*Kjaa.te*sig ■m*to*tim** m-Jmw* aatr le^vta*** - $§jii# $iJMt - that ; booa 
virtual Ig J*Bto r i ib b igaab to «i th reeeect to denotation, fhe Irt^oronee' coaee, 
of cour«B, at *he ***«< of coontrtatluo, and It aeoab Wt this Wi It be the 
case for eH eynonyee. '-'^ :: ' : 

To gat thJa«neu connotation I nf creation into the oodel, ue introduce 
another sort of -fitM. fhfe fle»d take* synongeous uerets and orders theo 
along |x*eaartl*e Jibe fordaHty, '*acte***»lityv- ate. $» a%tf*ttiena Jn the 
eystea 4tso axi%'MNohf ambdMla*,' ^.eV^foraai, "t^a*ra#6ilV'-ate'» In 
another eyeiee that aoaie with a wider context and hs« e^rieher vocabulary, I 
ouopoot that <hto would 4*ont greotor eMbrssel'Ve froaaoor -than is given by binary 
categories. 
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3.8 Choosing Conwpt ft«rlwii. 

In ths chapter so far, ws havs rtl«^ip^ih« ; MP^(^^«Mrk»ri should 
be ordarsd, but there hat been no lndic*|Jej» Q^JfM they §£pJd ^f ^P**^ In 
fact, it is not at all clear Hhal a jmt ^ajB||a|itfc^|N)^^g,a^lj|I.Jo9k like, 
because theu.are used in the suataa ^j^^p^pm- HQP&Qkt. Struc^uree 
buU t fro* con^t jMrkers are of cenirjM J*£$f$m$$Jp Jfca *#**•## »° 
although tba fol lowing discuMJ on gsts ahead of <^ jge^ll lon, b J h think it is 
important to • top and consider »os« pf the Issue* $fQpifH$> *)&.&?*&)&*■■■*&!**** 
markers. 

For each oan tones, the semantic component .^ly,, coipitri|Bt A sa*antic 
representation. This h\ II be bul It froa,.oonca|ii^f|a;kjf % ayoRjJail by Hord 
definitions and the semantic special la*. rou|inea ,/t^f«^»^^ f ^.«.J«*P* nt,c 
rofrasantation.bat.. bean poMtr^tod. f^,4j|p|aiw« f! ^^j^^nmt concept 
markers Hill he ussd to call o)ioucti^r^ina^a«|ip||^tej hX*$ ^hem. It is 
the job of the deduct ive coaponsnt to pick ths aost Wfr^f^Jfllfnf&y^,** 
ship it oyer to tha generator. The jjene^atpr,. In tyf} : . m\}1 {.&$ [the eemantlc 
representation to produce Engl ish, and, for thja,. .tbj^MshJlfitionaru. »■ 
ordsred so that word* are essecj^tsd #^g|lPg!I^B .^ kers or. trpupa of 
•••antic markers. The concept Barkers,, J^eii, j?aye ,a fourf oi^ rolai in epurce 
language definitions, in t«rQ«t languao* def inltlcms. In the seaantic 
representation, and to ths dsductiysjai* baae. fAxJ^sar look at a fifth 
rols, that of sslfction rsstrictlonf, |s deferred vsjUl the, next section.) 

Given the various roles played by tb| assjantlce^ us consider 

the choice of a dlctJonaru defin^tlon^|ot: |he : jGeipan, f! yjse^ 
break"), as used in the example: 

Fritz ta^p^ daa Feostsr. / Fritz bjejoke tba «lndou» 
First, ue could uss a apecial semantic mar kjsr e|^AK with a standard 
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definition procedure for tm*ti&m<Wm*mim-*.2Al*- Thi« definition ui 1 1 

m co*a*et?; mm&w&fc*:' a&itimfr'telbfMm'&i'lmik- t3r tn* other hand, 
thr aarKat- HWCAfcM* * r»Wwr 'to^te^eiteV *af nft-nMI#ft dhooW i'ne-feaft' to 
wrl t% «*•*• d*f1i#«*iB*> W : #. '4NHHtl e fttMtyr- irt^Wj^ NitfeVaf t#ouiitf a chunk of 

mm? rnrnte mm\9#m\9>mr>*> ttow )um? #«jiiWftlir't*1fti*^ryi*i*1^i eat 
var«u8 oa*» of «MpW*»*+WT. / Hear o ewta^un uetty »eWJMl iyxtr«yfu ''; 
lan gwy tt#p*nU*w^ ' t»<< !•» of dUtf^oM» wof^ t hr ji< i» f>f , «ntf p»rftotilTty to 
when tar a#^dairt'4iaj|t»*tfc» two- •aoerelta-- languagee l« avolne/le ayetetrv * tfciP 
generate tafcwr t*» ii ta wrti e i»>j i e i e mif low t» M ful ^unaaol guaua, and 
oxpecta tlr tajt-eefee: ftp genera** vhfe l¥ ttftiM* 4 fyi i ih#'''d»41li K W : ' l ^-iil«feti ve 
coepohent (e«c«p¥ Mtevitiht ■peel el oaaw t*»t*» iriW d* d retutaad) . TnV , 
segeent* or tfw- n^K f y I 'B^ w a i Wat ToffTKwr nwef fr ^ B* slepta (tou-tavai ) 
anougtf bo tfHrfs nW l liWW lW K. HiiM^-i«lh WW lia a tm c f*»proeemat*Oh 
rMfam -'«He*geiaet#ei*;. v "'"* r -;-* .■-■•^■■r .-»**♦*.- *,v -'•■■...-•■ a ----- 

Oi^iritiiflti<<r MlttM^ aifttr. beetle*' tt» wittirtl c- r«pr—#n t»t 1 on offly 
f #■'* dttJuctt •#&* 'l#e*ajk-t>feiw*#*-©&n^^ uiwpoiiwit u and be 

unatfaatwtMy abtaNaalNa^aai^^i^ itoVto*> eh^Belna 

bwtween^r^aaeft*^ He own, eueh lower-l '" 



r epr«»>ntaf t on to b« pl w wea on to ttw- qany tor . . r thW^"lW^iWif' i5W *'' 
r » |j p o >Bnt ttt1 on mwaUd- try to appro* I iff f an g aaytMn a ap i no e HUB , ? I n tn« eihaa 
that It uou> data to b» Input for a ya>W ' <tw bf any lartjutoa. ' 

The* ap » ir uath rttatcnod aaaaa I ike a good >a<ut ton, ' In that •eurco language 
d) oil owai^a*fl«m*»a? *c»ajtia*t»- coapeet end u«1 t#if 'Art t»*iaf * til* seea t 1 m no 
infomatlon wouJd ba lott on tho way to tha gtnerator. :l l ; 'ffiii: ne* a ehoiiir 1 1 
for t«a raaaortB; fMret", a »ou rewtf r i>» o iintat lon fatawia iw^it laaa ui th 
targat ftawajtfe* do*t*t%lo*ie, lift*, o low* lata* eeeenrtc rap iraaameflort. a 
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large number of target language delation* j*eu*d involve aeeociating piecee 
of the eeaantic repreeentaHon, in«|ead el ju§t#,e|^|||b|iarKer, »l*h uorde. 
This is a pattern aatchingpreoieethaU^ of uorda. could 

elou down the gei»eratipp*proeesa intolerably 4at s taaaf u gi^,,qjpent aeaory 
architecture), In addition, rate that ti|s^ Y ««KEN1) 

representation of fere no clear advantage* for orga#|^n^.the i dedyc^va data 
base. The ke»?nei of a|*nlr* r aaeec)ftfad^ ver^lr©* pnp 

situation to another, but, 9» Vm par Aicieanta change, the nature ; pfj*ha 
causal relation eeu. e» **ei»« for mm^\^ t*ym^ ^ms/K^^^^^^^' fM 
thaairror*' with -$m~ cawsaHtu. involve* ia £&r*Jn iajpaeoVJhe peesjprn." The 
actions associated with causing breakage are hitting, dropping, etc. To cause 
corn to pop, on the other hand, >to^m***Ml$^G9m»?&'^ ■*#*-. •*■* **•* 
eourca* Whet I a* tryifig, *o gajl at hers is tha£^M J «**-!•«•' 
repreeentation^ j^ \^^kdi^m,f^^h¥^^>lf^.'^ "• 

associated with coeb in* t i ons orf concepts, i.e. £*|fher : leW 1 *** 

Because of the conizations aenWpned^ jn,#e*^e^| eepentip Pfrkere 
are choeen ae, thp union pf $J|ar^t^d,j|p^^ cap-.tiaye 

two different lexical rapra*pnMjyona .It^Jpojnmn p&J* Englieh /e^cluaiva of 
connotation diiferateaa* that <.ipl )ff ,-lha louar t iaip^ T |se|^#eenta^ipn Ja chosen. 
For exaapj p, BaVXaW ** IlllMWr i" to hnouU «lnd,up *Af h the tHP eepan£l^ 
aarkere ,.mfH*>f#&: **» IWfl^^RSPNr ,#hj la«*i* #Pd *&& (Jq6er«arH , 
Uind ) get the tuo Barkers #GALE and (BUND. With Ihjys Upr^pi prganlaitJon, 
the deductive coeflo>Ma>tJagVveij adde to 

the semantic reprssentat Ion it is ^«en by, tiding ^j^a^^o^, J*fj%,ppen 
by the eaaant ic epeciel i et robins*, Iha. teaant ic njpprsspn|at lone, generated 
frop-thesa eaaajiftipaarkers have nppieU|^t#y : fp, J anguege I n p ^pp jj d pnna . but 
the toss in generality la cpppeneetad f^^jfa^ter,,|pna^tlpn.= 



. Ufton u»**^ c»e«e»t- «©r*»r» to H to i operant to 

r oooop o r.- thui^ ttwt^ g* v offi« paH iM to for w t lom ' !**■«••*; to- o^oyotops where 
■ul M -category d»«i^1ton» «od re»tf i S^to»i»^»»'«wt»tv th» > ■ — rrt te oorkor 

rOO^t^tfe^OnO-V OJN^OJPfWOM yj^ nO#8lhdiOjt* s QBW"J|^i , M^lBW9B^'f*fV' tMMfcf*0)ofeOii|(MByi,'. » •'> 

off IvUMJ^ a* di m omo* » ■» I )«r 1 f MJ» (it l oot oiri .-' fth a th»r e^trogBofr' of f fco 
concept wpke*' reo^.4o«o^^^ tOMgwt *'?haocM#'00 

»ret8O04cd i f> • t«Hi»' wwi ooip*t*- M>»0» r— ^moN<I himianyi , tMtctiw^ 
peetrte*io*»i then* took M** ■ proo4»*ftfr uo» to iim 0Mj0irt*co t*^t***«Hn« : 
parso* 

Hits f trot vere4ae^*f~ M0 oyete* ^ irtfor»#Mon 

to Quid* tf* per*e*.aMheui*o«*y too*v^ %lo m«* H* the 

rout i n* t!*** p o fo o o ver b ofrjortsr i f the- ver* »^ bee** tounoV- ft* rejrtHct i one 
were ch«*»d to »•• 1 f «»^ irwludt* »r loc«t4<»* o OO HnOf » or too* e*her- ' 
property. !f> thto- moo the c— o, p rojw tHtoi^ phMHN* end adverb* Mero 
checked to ooho »«>o that et I — » t orm of tho— ft t tbfr ttobi ft t c roqu t rowan t . 
If this u*r« not t*e co»Oi thw^ «|I•o^•^•c^<>ff COtHo too- takwr, Uko checking 
other tyn*oct<o rimoenanU to poo I f they hod Obeoroode propooi t Ion by ' : 
■tetiMM-. ^ TW**eW o#* oouroe, ontg H»V1od Aio»of ■ i—n< IOo to gut do the 
porso, hu*« i t woo iff aft oroo whoi^ o t^t of hot|i to obv4ooo<tf n > »dM> » neeely 
bind+ng of adwttrfetoi oodffiore. 

In tho w»t rooont voroion of th« »o»*«»* thespe^ee^ooea^het'-uee 
selection ?re*i#is*1*f*' th ^i* h^ ^^ oW by 

Qarew ue^?oMbjito;<. '^ f^lnyHeh ; :cHKiiO' 

9*t*x&Mmm- tho oo4n v»rt» pf > » c oo>» 1 1« ob>oot». Uo con t horo f or o ofccooo tho 
voi^ f «*rootPiotloB»^abo v, i»fr''th«o tO: ; 1ool^foi*' ; op'OWtt«i^ ; o¥|icttw ■ in Gerean, 
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however, only aoee Main clauses have a subjsct-verb-objects word order* whi I e 
secondary clauses are usually ordered subjsct-objecte-verb. In addition, any 
Main clause with an auxiliary, Modal, or passive verb structure Mill alao have 
ite Main verb at the end. Since uaing selection restrictions to guide the 
parse requires a certain aamunt of structurs (to handle Multiple definitione, 
optional objects, and variationa in word order), the invoetMent in progressing 
effort eeeMed to proeise lees return for GerMan than it does for English. 
Selection restrictions are, of course, still part of the systeM, but they are 
used exclusively by ths sseantic coMponent to sliainats iMpossibls ssMantic 
representations. 

In the current iepleMentation, sseantic restrictions »• hung on the 
concept Markers as LISP properties. Since the Markers have an explicit tree 
ordering, the restrictions need not be associated with each ssMantic Marker, 
but May instead be tecked onto the highest node for Mhich the restrictions 
hold. This saves space, although of courss at the expenee of the saall aMOunt 
of ties it May take to trace up the tree to fetch restriction lists. 

In this chapter we have Made boms decisions about ths ordering of concept 
Markers. First, concept Markers were divided into objects, relatione, and 
properties. Two Main order ings were presented! a tree to iMpieaent 
select ional restrictions and a lattice as a pri Mary ordering for deduction. A 
eecondary ordering uas provided by fields, to relate concept Marker e to each 
other along a di Mens ion. words wsrs ordered by virtue of their association 
with concept Markers and according to their connotations. The static, 
etrictly hierarchic concept Marker ordering proposed here would not be 
adequate for a working translation systeat however, the conceptual structure 
is now well enough specified that we can go on to describe the eeawntic 
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component. 
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Chapter 4 -- Seesntie Processing 

4.1 An Overview ,,. 
4.1,1 The Seaantic. Coaponent 

In the last chapter .i* :; cofli£ee?ea%%.ata^jU^a^^ but we 

have npt yet disqutsed hou a particular sentence re I Stst to this general 
freaeworK. In this cj^antfr and the ..neat* ^UUg$i|^.fejaBe^fea,t eituation. 

As epon as the Gereen graaaar has succsestHi^JMr *ad eea* aection of the 
sentence, the seaantic routines are called ^ .. TJislr Job^Uta cpnetruct a 
seaantic representation for each possible Interpretation of the lentenee* In 
aeny places, the sJsape pr^ tha |>area 

tree, but at other points, -the fj^ge^^ll I t Ja>^||p^, : fa>ere the par ee 
tree is a record qf syntactiq^eiaJUqw,, ths iwsentic reprseentation la an 
independent structure to rfjcprd Bseantlc ,cajya,ti^»B^,t|aJt,ar t e i bo«> ieplicit and 
exp licit in a sentence. The fej Lghl y ,etrj^JUu£ga] s s s j pttj ^q represent at ion 
reflects eyeteeetlc linguistic pbanpjiee* #oa\ t i | U ; 4rff^r^<:^Hj|||,froji the 
syntactic represent it Ion to ths bocly oMnferMt ion U*t would be Invoked by 
the deductive coaponent. 

The general organization of the seaantic c o apone n t follows Uinograd'a 
systea, although there have been soae fairly high level changes. In the 
translation systea, the seaantic representation J taaf* play* a proainent role. 
Uhereaf in ^t?QUJ tha representst ion is essentia My an interaediate step in 
the process of building theoreas for deduction, bars LVjh**fceA iaportant aa 
the input to the generator. To use ths ssaantic rsaneaentaiion; in this way, I 
have aade a nuaber of additions in terns of the infereetion it contains, 
especially in the direction of a aors systsaatic traataant of thaeetJe 
features (section 4.9). 



*8Pt fioesie**, *erd <tefiftH+»n tgpa* 1w* been »*amlardiz«d, so that the 
individeal word prtome d w o » oaw Oo «— d aoocript 1 w*y e» «otl «• taparattva-ly. 
That i«, the dofif*tt*n» ofrrM *i<ea4nid-'*u tho uimiiH c eooponent, which can 
then take «ny a aa rup i tot« ecOorw ***** 0*000* t*B ^ ! j*riibea\ir<o'. A aechenl m 
*ao ***©<•*•* 'MM««MAH#ttW iMdnwrtan ^ «n »r «M- j% fadhien (the 
ttJNKMC nerfeerj <eee est* Hm 4.7.1*, - Tnii Qoaoiflli <iaaj«iiliini -«m Acid a 
TtfMfeMh «*• 4Ne i <poon>l *t»w oefot* «M **% ifldWuU <*re bound, and then 
return • to add tho «rij*eertt« of "th« pe*»tfo* *t thou «r» eneouhterod. 

^io chapter **#t w two pent**** * dree* fleet «* dotal I , but 1 1 actual tg 
dowtt titft twgin to «**e*»t the iaotiM dlotuiMd. feefciwn 4.2 deals wl th the 
way worO *tfffr>ltl»wt «r* eeed to ouiii W* MOontlt reprooofttotlon, and 
eoet*H» 4*4 ri> wffdwoe» *ther contntnrth»« tb the ropr»»*ntot Ion. The actual 
t itforaartSon 'fry tt» ^roe ow ia tion ii oaoiloHaod in iocrtion Hf& Ute^t ell Horde 
cwitrtbuto to 1*0 toeaentlx f-otr»*o»tt^^ arid this Is considered in 

section -4.*. '*Men4.7 dfMnmwi^k fhet dauee Opeclal actions in tho 
eyo$e», Kh-i l« «ei*tan» 4,6 iflfl^ handle tho r ey sswU ot'lon o* tdioas and 
^orofononco, rto po tH H^tf. £to»| fg, *edtlen 4.% dleOUoosB tho roprooontat Ion 
of thtMMt ic irtforoartlon, .and section 4.<t§ treats seaamlt case. ;■' 

• x«dC rfw TOB^PrwVvrTCafT ton 

- Tne o^wrt^ Poor«»en*»*ron h« oo^tructod fn» thr<e aorta of ' 
oowporwrtrw oa^ct, ro+oH»n, s»»J |H-operhj liWitlt -Structures Tf088, R9S, and 
KB8, roopscttvoW- Thwe ou o owwriU org PuwtH«« of frifprost ran, and their 
4 *r*eos -tufssjaaa <h*tet<re^He««hip* *»WWi« -a nntonco. *h ^general , a eoaent i c 
r**me«en*»«i«M m wmvmof* *<atheF than « i 4MoJaL : ' ' % WfMrlift i a Hng between 
two sorts »f -U tSteas , ^owsver, *m can »Imou» HikJ en utk*^ lying trso structure 
in a wal l-forssd sentence. A eaapls repre s enta tion W shewn tn Appendix 0, 
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and I wi 1 1 spend moot of this chapjar dl scum If* Why j| Joaka tli>««JJ *>••• 
The d i ayraas that u i 1 1 appear be Jom af a, je^uay y flfOTt**-!! ^S pf i,m 
sapantip raRrPitntaWon.AlfiPa the ow|pj| jB^.,^aaa^^j^|f^|ia|it contain, 
mora information thanja sbota> „ ? Jn tha jia^r|^|fr#|^^JnlJ0r|«t1on ia^bung 
on the L JSP property \\%i.p%.m^^j^^.^^jg^ff^^,^€t^p^ : ,, 

A question should be . fjpl apijl^jjfiriajtyvpi ^p| #^*9pM#M>fliM^' •- f^VRf**^ ■ -.- - - ■ ■ - 
representation shpujd rae^asa»^~J|teila^j^ the 

semantic component has been di»cu«sd In .apra^jleje^,, la.| a*. Jmjs| : distinguish 
two levels of eeaantic/Jnfpr^ |ha|pj |o, : ; ^ 

I aval a, BasiefUy thapf^sitjena^Js^eJ ^Jt^M^M^h \ 8 -r.JfJ d *r. **• 
theaatic level to the way it is. said. . The, ,$! IB ^*VM^^'MMl^ t &M? tlon * 
I ike Mhat Inforeatlon it i«portant in a sentence, jwhat.Jha »peaKsr wishes to 
convey, , and the. aaeuapt jona he r hae fi abou| what Jt| : .|J i|j^ej^ & kno^§ f , - ThP«. 
aeaantic representation used hers is ApteiurjLP* jroooait4oipl fi a^, : tha|Mitic 
information, and I nil I coae |^,to th^jp^^oil 0^ Mft^* ■ tJB #P , ^$ 
representation should look like below. 

4.1.3 Bui I ding the Semantic Rapratsntat ion 

The Boaantlc component, like J^l ts Jaa..^irBa partti , ti jtha, semantic 
specialist routlnei (SflSPEP, the ssaantic utility routines (SHITJU., and the 
dictionary definitions. The acjual, bui^lng,^ .tJ^jpjpipmaijta^ajKja dona by 
the SMUTIL routines, and for much ...pi, the, semantic repreeenteJt Ion 1J is the 
dictionary deli nljU one which pake the caj 1 1,40, fiTUTIU ^The dictionary f 
def ini t ions, in Jurry are unleashed by 9$PEp atiar 4 i ha* sat ,^14, . *ha 
necessary cat I ing parameter*. It i* the SttSPSC £pMtynej^4bat: aya^pctuaJ Jy 
called by the oraaaar, and aany pf th^se^^roi^i nea qprnyt^^yOrsmrt**** 
const ituentsj ttftre^aff sTtlJIUJE,..:?^^ the 
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other hand, hare saris* of ssaantlc lejstfaflsts, SffCl through SfWG3. For a 
siaple nm* group, Wltti 4« caltW « sew a* the sain noun has toesn found, 
and it wilrttM sny Brenoeinsl e*tf?n«t^ »t*rttng wVth thos* elo»s»t to the 
noufu Control than r»ttrmi to th« syntactic iiasjunom to paris'aHy 
eu*H**or* t and «fcen 9W62 1 inks tftss* ouaflfTers t» dM's1ttfc» SHE it also 
responei*j*» far *¥*fti»t1h| rstttionei t*um, uMth ST* not touched until the 
entire 'noun group has been parsstf. 9»*B3 cheek* far rtfartnc* to "einsr parte 
of the te*t if tm roun gr^Mo 1» *»^iwH«. A»ao part of tha noun group 
package is SrfctWttlf), uM<^ handles cea|>eu»d noun* that ere not In the 
dictionary bur uhbee coepsnsnt aTaia«^'ir«l ? "f^4|r1% fefftth* and SHPR0N2 
handle noun group* tlart sre pronouns. -''.''■"■ :-^ « $-^ -■ ■ 

Mote that there Is no separate Me a nt is specialist for verb*. Al I the 
action* nscoesarg 4fr vsrb* ar« tk>ne 1^ tht •rnoV« SIWJWSE routin.. By the 
t1*e STCUHK is tssilee, %he itfbjset, verb object*, and eodlfler* have al I 
been parted. Ohio it true en o ug h m far at "it goes, hut not coaptetely true 
- eee section 4.7.1.) SttCLMBE tolnds fhe rslstlen ■pecinsd by th* ''verb' to 
its participant *aub£ect + verb otojsct*), then binds ths sodif isrs to this 
relation. This whole procsss la rtfcndr Jls%era1#Wpf&#cb," and the 
disoussioh later on uflfshsd seas *lf>f on ths hinds of i nf ores t Ion SMCLAUSE 
.has to process. 

Uherover ths seasntic spselsiists are c«11*d 1n ths parss, their general 
role is to" -be yea- tr r>au- m^^«^ Whenever a •saantic rslstlon, say the 
definition of a verb* Is bound, cheeks aWaad* using ths wTsction 
ree*r4c%t«n* f *os^**sd^ln J W If at sny ties no - 

represehtiitien can be Uui If for tht section psrssd. fjnsn seaant hss returne 
f eifure *o eynta*. In the systsa as 1 1 'stands nou, ftsesnt I c« rm*t touch* • 
the paroe treosMcspt to get inferaatlen fro* It, s I though the uhiwpleeantsd 
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ideas in section 2.7,1 would involve a sore active ee^jaf^Mwh 

4.2 Lexical Seaantic Structures ^ 

4.2.1 The Standard Definitions 

To get a better idea of Jneuay a ssjpeji|i£ re|^ 

let us take a bottoa-up approach and star* u^|b jMes^ea_ ; oJf tf $!e three a^andard 

definition types. 

(DEFD ELEOONE 
SEMANTICS ( 

(NOUN (OBJECT CONCEPT* MELEOONE 

CONNOTATIQNSt 4SCIENTIFIC 
LABEL: Dl)))) 
(OEFD FOtG 
SEMANTICS ( 

(VERB DELATION 

CONCEPT: JKGO-BEFORE-IN-TIfE 
TYPE: NONE OROERt LEXPAS8 
ARGS: 2 LABEL: 02 )))) 
(DEFD BLAU 

SEMANTICS ( 

(Attl (PfWPcHTY CBNEEPTI MM 
CONNOTAUQJIr +C0rip. 

LABtti otfrn ^ 

The seaantic definitions here have the folloulng parts: 
The Selector: 

The first entry in any seaantic definition i s a s^|act|^ feature, which need 
not noceesarily be the part of epeech, a^ dif^no^ehlpjefture ^M do. 
The syntactic features that Mere chosen Jn. ths .jou^se t p|^hj^r eo, . caw : thue be 
used to eiiainate seaantic possibi I Ities bj aftc^ira.aajlnet the firet entry 
in each seaantic definition. For exsspls, |f s^^pp^haf been j>ereed ,ee a 
noun, there is no need to consider its aeanino|^py| yf^b. If acre, than a 
single feature is needed to dlscrieinate bstwsen oaf Init loos, than a Hat of 
features eay be used. In addition, if acre than one syntactic f seturs (let 
takes a particular seaantic definition, then a I let of distinguishing featuree 
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prefixed by EITHER a«gB# a*ad. ' 

The Routine Naaar 

OBJECT. RELATION , and PROPERTY roatthe* ar* part of SfWTn., and bui Id 0S§, 

M- «*# P» iniiajtn***, r*aaeeft«#t* • *mtw* *t ** murmti** in the 

ddffnfWaw »^H*e perePetera for ***** ftRJttn^ feftPM ^ th* foliating 

keywords* 

CONGEPTt ' 

This is the a »a«nt tc edrttee. ueed h» eui fdfog up the eeaentie r pre wnta 1 1 on. 

1 1 i ■ a part of the concept •tructoi** dltou i m; t« «e> prawiou* chapter. 

TWEt (re(«th»« *»fy) 

This epeeif re* *h» Mftff left e# Mm ■ wrte oi ■ B «aa aU to tht eeaentie onee. 
Type* are ONE, Tt©, THREE, NONE, TiBfcTflflBI eta., * tfMy tal I which of the 
tenant re arguwents »#y b« I aft underetood In the atorfeee reprceantat ion. For 
exaapfe, mc way e*peef cPr'tPih ra*atJ#i#' & have an ' rnetrueent opacified at 
the aaaarvtte tevet* Tti* relation Jt£}?* than. Mould haw three ''arguaentai 
actor, pat Tent am* mati*u<wnt. In tha aantanea *Karl •tehnttt dee Muret ait 
e!ne* Hotter* llferi eat tha eaaaage »*<n efcnife*), the definition ueed for 
the vor© •ehntidmn would hava type NONE, t*net no arouaecrte are I aft 
undarstootf. Irt **ar-| etnnt-tt de» tturgr (*tart eat tha eaueeaVr. * 
daflfVl'fnaii if li|i^#M «»W at deed* tinea tha third arouoent - tha 
ihetrueent - )• Wft understood. Slellarty, ^ih iteeeer sennit t daa Uurat" 
<*A knif# cut tha tttuaaae") new* a at? WfBen af t«pa ONE, iTnca tha actor ie 

left urwterrtood. ■ '- 

. • '• ■■.-■■ .-.■■• . ): 
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ORDER: (relation* only) gs 

This is another way to •Mpply infor^Ae* fpr t |^tpW^j|f?,«^«#P#^«ff»H to 

semantic ones- Order is u»*d foe .ward Bijjra 1 1M rJttaH »** W«»> W _ in 

German yjy^afcaa and J&Lfll*K * "*!£ t«^f^lf t « J» •*»? * - tf^P £ •JflffM fj^nt J c 

marker, and to do this, ona of ^h« paAcjui* Mft»llfi*#l If^icfAM*. ^ftUvjl . 

(LEXACT) and the other a. ^iajilfMl^ll^W .Mfn^b M***** .W#P*»- 

as in the exaap I elation ..ajjove,,, ^^^uighfi^Jlh^fp^pM^J^ ,|fp^pl«al^pfa: «i^i«et 

(in an active/se^tofic^rJbeepjp*^ 

Tha dsciaion about which word is tha Aa»^l^j «£*»*•, ¥!• if** Mfclch the 

passive ona is arbitrary* , « : w 

ARGSt _ (relations only) . ^ ■*.■-...■-?.%* n 

Th i • i s redundant inf oraat ion, since w| can a I Hays r«jcj|pr, s; tip,. fiM»|Mr of 
arguments a relating JMwe given rtsjw & ^l|yfied^«^^ ■*•■:, 

associated reatrictions list). ARCS IJs wz\ HMpdJii V* ^|i^t i,p^ ;; ,arp^y . 
however, partly for efficiency reasons, and partly to help M keep track of 
things when writing the dictionary definitions.. ,. 3 ...,-■, 

C^|NNOTATI|0iMSz 

This holds the corwotatlon inforaation santipned in aectlon 3.7, Mhich Is 

expected to be in binary form (fffiaii^^lfiW^r^jff^f^v^ 6 *^- f^* 1 * 
now, this inforaetion is optional, Jnd }\ J*,m, W0 MkS$* ■H ,t r%*" l *M-- 
generation. . ,^, : .......... 

LABELj 

I f a mot d hps *pr» than one saaant i c del Ml) ,tJ aa> each J s g I ven !5 a i aba I . These 

are used for error, i»aaaj^* ; and- cr^aa^Mrtara^ 
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The syste* htf« a group of function, that phi* irtforeetion out of salient ic 
def rmtffenW'-. Ti . ti i e are used N* Hi to eat* aura <h* the right global 
varia»f#s ore ssf f* yarn** case* -' Wist fhjf%f of a%?1nl tfori Marches 
atitturtte to Fa *» aWNy to uw diflrtlfTwti <W#Cfii*t1v«l u *• uott es" ; 
}«t*raT?*My. - ^iV#-» rf« M Wfi *it^^ f#«^ o«W ^#^ffr««t?«r, sine, mm 
tnfbf^M1^-w^^«M^^ laH^^eo*** *br un by 

t*# 9W9*fie reuunee, ¥a *w ar ttatfia* aesuat •eearttje *itiMWt\m. The r ' 
ctorrttttarflMli^^ fhforaetlon. 

hoaaw, e*r tam i* h» tmt ^^inart»lj jtf^'^Md^mmtion*- both 

descriptively and procedurally. • •^" i -*' J :0 ' ;!A * ^'■"■■-^ 

Another tort- 'of inforeet ion the* should be in the definition., but is not 
non, in a mature of pfeutibH ittt. Tfrie eatd* tawf ^^or* W a rnieber that 

.pgBtfljHjr "irti- orb1laTI^'f-^ ; ««li:i- ; ii-J*^:%^ ; ff-e%a*i give* ^at" the* word 
occur.. Such pmm f rttf Mature, appear fn ttnograd* » suttee, but' were' not 
tifWiiar* because of tine Viettlftona." ^ '"'"'" * flv, - ;,: "***"■ ! ^^»-.--. 

4*2.2 Other Definition Type* 

In addition ta the three standard definition types, there are several 
other dictionary functions: mtlML, SHNOHINALIZE, and Wf€L-Wit. X SrtfclAL 
deffnttHwr tyi»d ;t 1¥-liaad-Yor ; r%ia€Tonf ^tnaVao not fW ok lading definition' 

nipei-* wsrmm iMfi^h^totWirMfUiiy' uiV it. 4 

Th1* aorfW fraedea WTetfng vdfueale for •WciptVonal caWs like ajji ("to 
be"), out it he* been uMd sparingJy efssuhere In the eye tee. Oftent a 
definition will start out as 8P6CIAL, but then another word like It uill toe. 
along, and eventually tte haw a class, warrant ing its own function type. 

mmnUNLim is e^ooffnitidw %& am gficf ft Handle ranHhi f t. Just as 
urn ha*o clause* aervVnr a* ranmtf tetf 5 ^^ or 



■■■■\ : ^!%$^^0$& 
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hier let." / "I don't knew why he Is here."), :jt|eee .fee ei to relations that 
«aM,be represented by nouns CQn iiifcafl f*Jlt ih« >»«»iM»r." / ^H« h«« troubl* 
flliMtiaa. ") III both CarHn end Engli*. anu *er» Infinitive aey be • 
rankshifted noun, and thara ara alto yordt like "dar Aufetand" ("tha 
uprising") whote Maantic d«f InHiont ara Da*i«aWy relations rathar than 
pbjacte. In the seeentic reprswa^on we want these, te appear ae relational 
but, si net tha syntax it a noun group, ppaeUI eetMwt have to be takan f ffor 

example, to hand la tiaa). SMNOniNAUZE, tosnr la reoponejteht jfor, «mmm thia 

k, ■ ' " 

action directly api also acta as a tianal ta athar , mm h a m lH at rent km that 

apacial treataant is in order. 

SITCL-PART hand I as nouns that naaa a participant in a rs I at ion. For 

example, tha word Zajfibjp, ("4**.te«t N»") rjeanrba dsf Mwjd ae "aoaa thing ihat la 

acting as tha f irjit arguaent ,fef tha relation #W(W GATE. VJhe SWEL-PAPJ for 

£aj£bj(a, than, Jiu L| da up a place of rso ra to a tatlan th»t eeu * 1 *aaW be> 

rapraaantad by Fig. 4.1. 



res 

•INDICATE 



HEAP 



L 



tCTA 
eSOHETHING 



HETA 
MJNSPEG 

■■;■ W.f w W-Wt" ft? 1 



. - .-. Figure 4. l-.«-,. 5! -. ■ 
The flSOftTJUi^ and WMEPE C hr » ar« worked ffZTA bananas they ara a*ta- ; . 
concepts, °T« concept *ar lab lea, Tha XMWEC Barker 4ifpr> "unapmJ f imfi .^aee 
section 4.7.2 below) would probably ix>* be ueed bepe, jutace, the; ^oroa)tfcon 
would be given soaewhere in the noun group, as it is in our saaple text ("Ein 
deutlich sichtbares Zaichen fit die. ... EjxtBMDflia" I "A clearly viaible 



indict t«« fl f tto Jtmtt^.M fc*h tfto-OflCb-flftf ontf tt» STMOnilMLIZE 
root < i»— ar« «t t rout f oo c l w oo tftoy,of fo> «towu M Ui m do not nwrt a »«pOrat« 
conceit *arHr #0* r« !•*»*« or r«*otl Oft port ttrtpo** **##ti« 00 "0*-o*f00*. 



' **3 I n t m m i \ m **t ttm »0mo»» Hr WrwcHrw - - 
Uhon tho Oo*Jr>m«n t|»>t ■UiK' t ar > •ngB Uf^ wintt wnt lc cowponont» 
tho* --«intrt#'4lllfHM4WM!Mt < MrifMittMftr ' r ' 



Tm< t» two nOoo now o rotod for tho ■>■■!■! tte nodoi no tttttMdi MM* trm- it Ifco. 

# wMBfcr 4« IHtpwl to an Iftd I v idw*» htotont lO**on •« ; on object, rol ot 1 on, 
or **0poHO> - ■*»>. n oO i w olw r o»ow Ol f th O ^ t t ff 1*0 uowO o of buHdlrto i 
roprot ontott on Jar » oortloolo* O»rtoetft o fruchi f 0, ^Mf> l Dw voriobto ro«Oln» 
tho 



CONCEPT- 

Thio eoiwto fro* tho coll in* p*r«wrt«r«, amf^t to our old friomd tho oooontic 

oorkor* 

RESTRICTIONS- (PS8 on« H88 only) 

Sol oction rootrtet^ono or* hung •#oo«-*<# ooneopiuof otructtro* bo tng 

aooo«^o*o#i»4^'»oOi^*ortO)»'. ; l%»o**oiii i^aoHy or v ilii\i'0i^ thorOorf, Qnco tho 
oo ooWM c utility piiitf too hov* roir4»vod obf ' WF tytotottftihof ' thoy 'ir# fcoet 
on tho «oooht1c -woov fOf lotor rofO f o no o. 
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LINKAGE- (RSS only) 

This is a Hit of tha participant* in ■ relation, and it represents half of 

tha explicit linkege that holds tha aaaantlc rapraaantation together. 

HOOIFIERS- 

Another Mechanise for I inking, this thsaatlc .faajure binds GSS* RSS and PSS 
coMponente to their nodi tying rotations. Whlla LINKAGE values- produce a tree 
structure, when values froa MODIFIERS ara added in Ha can gat a general 
network structure. A seaantic node with Modifying relatione is called the 
"head" of these rflatlone. 

CASE- 
CASE is set to a concept «arker that Is .fflund abjur* *** CjJNfEPT on the 
selection restriction tree and distinguished by » IfisLfM tag. ThJ« i« 
redundant inforaation, since the cess is a I Mage derivable froa the concept 
Marker. It is useful inforaation to have around, however, especia*4y for tha 
deductive coMponent, CASE is set only for PSf^fad fJSS's that act as 
aodifisrs,, that is, those which .are ..conpcte^ to 41** semantic representation 
by riOOIFIERS I i nkage. Saaple casss are SPATIAL, TjppRAL^ and NNPfB* The 
use of the tera "case" for this seaantic feature say be Misleading, and it la 
discussed in More detoit in ssction 4.18. 

TYPE- (RSS only) 

This is the TYPE inforaation supplisd by the dictionary routine*. It is not 
really ssaantic level inforaation, bu* U is put* inip the rapraaantation 
anyway for efficiency reasons. 



8ft 

ORDER- (RSS only) 
Soo TYPE. 

REFERENCE-SCQPE- 

Thie is sat to aitfear GENERIC or PARTICULAR, depandina on Mhataar tha 
Information o4 van 1« about • particular atojaei, : ta4*t^Wi^#ii»ttJh' or about 
tha efaat thoraof. 

GIVUM®*- 

Thta ia sat to GIVE* or NEK* Sm taction 4.9.2 for an axplanat ion. 

C0REF- 

fhit givat a I tat of warn )c rtructura* that ara «rafap«ot tilth thla ona. 

It is diaeuaaad ftrthar lr» taction 4«ft; 

INFO-ORBER* 

This ia aat to aithar UfWKED or to a lift of t«o notflfyrna ralatioho in a 

elaoaa in tha erdar that thay appaarad fh ^w aurf lea ilri&tura. It ia 
ditcussad furthsr in sactlon 4.9;3. - ;?; 

CONNOTATIONS- 

Thia it the connotation Inforaatton froa tha dictionary definition. 

THEME- 

This Is aat to tna tamntie noda that c or r aapand i to tha thbfta of tfta clauaa. 

Sm notion 4.9.3 for an explanation. 



»*«e«^f|g««»S^f*a»s^ -•*:■ 
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RESTRICT-OESCRIBE- 

This is set to either RESTRICT or DESCRIBE,, Sse section 4.9,3 lor further 

discussion. 

CLAUSE-TYPE- 

This is set to COMMAND, QUESTION, STATEMENT, or SUBORDINATE. See section 

4.9.3 for on explanation. 

PARSENOOE- 

This is set to the parse-node that supplied the concept, if there ie one. 

PARALLELS? 

This is used in representing a variety of coreference. See section 4.8 for 

detai is. 

4.4 Non-Lexical Entries in the Seeantic Representation 
The uord definitions discussed above for* an iaportant part of the 
seeantic representation. Not all entries in the repressntation, houever, are 
formed by words - soee entries represent relations ipplicit in the syntax of 
the sentence. Soee examples of this ui II be discussed here - e.g. the 
poetnoninal genitive, adjectives, that sodlfy nouns, and coepound nouns. These 
are closely-related cases and by no eeans exheusjtivs, but they give a 
repressntativs indication of the iwues involvse\ Uherever there is an 
implicit relation, it is the SMSPEC root 1 net that supply it and «ake the cell 
to SnUTIL. 
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4.4.1 The Genitive 

Starting off »»i «i an exaapie, the wasntic representation for tho 
genitive construction "dor jtegsnschira dor Baas" ('the lady's umbrella, 
literally, "tho umbrella of the lady") eight look like Figure 4.2. 





BBS,. . ,. 














MEMO 
















OSS 
#LADY 




90S 



For "das Auge dee 
4.3. 



Figure 4.2 
("the age of the lometer"! m aight have Figun 





RSS 
JH4VE-AS-PART 














mm 


















OSS 

*EYE 


■'■* 







Figure 4.3 
In the diagraae, "head" is used to indicate that the linking of the two 0SS*e 
is done using the* *ep^ iter 't*30iFtiil; ratl^liatfthi Efl»t/^'>e#1atar. In 
the tuo examples, the OSS* s ore foraed in response to the nouns in the 
phrases, but the H® reflects an i»pl ictt relation. (Actions taken for the 
determiners have been left out of this initial pass for the sake of 
simplicity.) The two phrases give no explicit clues to guide the choice 
betueen #QUN and #«\VE^AS-PART as interpretations. Uhat's acre, there ere a 
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lumber of other possible relations that aay be ieplied by the genitive, e.g. t 

der Geruch das Kleee - the saell of the cheese 
aspect of a thing + thing 

das Buch das berubaten Poeten ■ the faaous poet's book 
creation + creator 

der Stadt eeiner Geburt - the city of ay birth 



V • 



aspect of a relation + relational noun 



das Geschichte eeines Lebens ■ the story of ay life 
account + subject eat tar 

There arm Many such relations that can be expressed using the genitive, but 
the possibilities here are not coapletely open. For example, 
das Pflanze seines Schreibtischs / the plant of ay desk 
cannot be construed to aefn the plant tt»#t» is on ay desk. To say thie, both 
German and English use a proposition teuf* on) to explicitly eMpress the 
spatial relationship. Thus, if the number~of relations that are iaplicit in 
the genitive is bou nde d , as I believe to peH^caaa^^fJffcM sense to talk 
about producing semantic representations *or %** e^ Wea-ent possibi I itiee. Some 
of the possibilities are in fact constructed by the syetea, using selection 
restrictions as a' filter to block the blatantly iapoeeible combinations. The 
next step is to take a closer look at these semantic representations using the 
deductive coaponentj but 1st us first finish up the discussion of how to build 
semantic representations before going into the question of hott to chooae 
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between ffMMK. 



4.4.2 Noun Modification 

Another exaapia Of • aeaantic relation thit Is itojtllsd by a grammatical 
structure is *H*f&ftfP@m. Any |*ato»rty used ta IN*** «n object <tor • 
relation or another {Hearty, for that fttttarl »» «!*** » #W«^fBP0rrV 
rotation. Thus, *der tajrrsiehtlge m*sensehifti«r" ■#**•■ nearelghtod 
ecientist") woutd toe represented by Figure 4,4. The *aTO-«WP£RTY relation 
la special, in thft tha eelaction restriction far fta firet argtwent la fourtd 
associated with Its eecond. In the we* Is, tha reitHtt loft IUVIN6-THING la 
associated with MeMlaffCD. to our scientist would pee* tha teat. Instead 
of conventional ealtoctionei restrictions, therefore, **TO-#WrafY hae a 
procedure, which Is atttoaattcalty sweeiitad toy the restriction cheeking 
procedure. The restriction code far : Wm*mfWFt retrlevee the eeleetlon 
restriction froo Iti second arguesnt and uaee It to #arfora the check on ite 
first. 



pgg. 



fa 



«fr»VE-fflbP0lTY 

< t i * i (M i j ' > ' "w pff|f|WBf!f l f> 



HEAD 



H i t ii Li| i ||i i i i rr i ";'jij" ' 



il l i l i i M»f*—» «l— ■— ■— « 



OSS 



:eaCifiMTijSn[ 



.'-'tHWlit-W'-.t 'C I ' ■-:>• '!" I 



jasaa&t£HHD 

;3)WMia^a^awaWW^T^?^^e^»-.- 



Figure 4.4 
One issue in noun aotfifier represantation should toe Mentioned here. In 
Geraan, as in English, adjective* aay toe •tacked up in front of nouna in two 
uausi ssrially or in I parallel. The parallel version Ve the one that often hae 
a coftfta "die kielne, letohafte Redone" I" the eaa II, active eledone*). the 
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RSS2 
*HAYE-PR0PEi$Y 



HEAD 



IHAYE-PROPEflTY 



IB B 'f-t f " ' |j tfWl 



HEAD 






«ACTIVE 



an j gwj iluif jjj 



PS81 



fip*?**'6 



RSS2 
JHAXE^eaQEEBTY 



MEA0 



"•WWWBBWWSPWW 



n 



MPIMPMIPIPMM 



0SS2 

VARIABLE* XI 
C0NO|PTi« 



i " i w» 






RSS2 
AMAVE' 



mmmmmm 

HEAD 



^QPERTV 



•aSHwfw 



0SS1 

VARIABLE. XI 
CONCEPTj j»pn 



PSSi 

'Jlum i m i uMt 



reprooontation for thM it •tnalahMormi!*, ■»>*■*- 1 »#Hi*« ****!* Strli 
adjoctivo lists, on tho other hand, oom pro M— a X ray— n tatlon. ffea 
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Modifying relation* for soeething ♦Ww " -*der nette at to Harm" ("the nice old 
ban") could be rsp rsssn tsd byt 

dor (nette (site Mann)) 
That is, each adjective Modifies tbs entire reaainder of the phraee. The 
-correct reprseentat4*n« therefore, would eeea to be the* ehoun in Figure 4.6. 
This sort of - i o pre o ontst ton is accurate* feut -I think the proi iteration of 
OSS* e would cause a great deal of aK^a«ff fort ifar Mia generator. Another 
sort of I inkage would be necessary to rotate the different 0SS*s of a noun 
group, and rep r eeentet ions would he tsroer. To ovoid>this, the eye ten 
abbreviates the re s * oasn tat ion of Figure #.#af?^ill|i4^i^ the Modifier 
relation ••tebUehed between ed fee 1 1 ve end noun inetead of a new OSS. The 
reeult is ehown in Figure 4.7. 



»Havc"»fwenBnTY 



HEAD 



RSS1 
#HAVE-PROPERTY 



HEAO 



m* ili ii i > 



PSS2 

-■-mm 



OSS 



mm 



FSSi ■ 

piirir 



Figure 4.7 , 

This representation Might break down if Modifying relations theeselves have a 
lot of Modification, but this would only be happening for prenoelnal clauses, 
not 0HAVE-PRQPERTV rotations, and cleueee will probably hot be stacked up 
serially ears than two-deep* 



95 

I should note that only the eieplest sort of pd*P*t* vpe are currently 
handled bg the seaantic coaponent. For an analysis of aoee of the 
coep laxities involved in English adject I ve«, see Vendler (35). 

4,4.3 Coepound Nouns 

The Gerun coepound noun is often tranalajtep; b^a claeelf for pluiknoun In 
Englieh (for exaaple, "4I# Feueruehr" / "tba flr»,P>P^taafit"). Along the 
relatione that occur between the parte .of Garaen coepounds ; . are relations 

discussed for the genitive and aii«|M«>--PHPt.-«oa)l|4a>nVi f WrJ l, l!%<^i , ' ,r WW c *^HM -n 
deecribed in the laat two sections Je si to a«»llc^(# js bpr*.,^ith copppMnde, 
we also pet relations the*,, could,; be i; e^^.eaee4 t usJpg#^^i,tlpna # , auphapi 
die Guaaieiiefel « StioteheueJ^aaV 

rubber boots (eater lei of) 
die Trlnkglieer - G laser xue Trlnksn 

drinking Qlassss (use or fMOctien of) 
die Todssanzsige - .pine Anzeigs wagon des -Tpdjap 

death notice, obituary (occasion of) 
die Seereise - elne Reite euf oer See 
sea voyage (p laps of) 
Especially with the eore ceeaon i ep licit relations, mo would like to be 
able to handle coapounds that are net /in v the djctlpj^ary^bpt jheee;parte are. 
The system does this in ths routine- Sf1Qp||C|ie5. I«§ ppce^e.a^aipppMe to 
that for genitives, the saaantic^pf^ j|o argroup 

of possible relations, select ion restrie.tionipereUiing, of course. Tfie 
representation producsd looks either I Ike the output of the gpnitive routine 
or like representations of other npu^ppdl flora. This approach is dee irab I a 
becauae the representation reeeablee those built for slpMarptfMJcturee, the 



^^S^SSS^.*; 
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dictionary la not tosdsd wi th I eU ufi dfr t ttefmHtow, 'and the egatss is able to 
cope with new colour** -H-Mn-iiMriMkMr-lHfNiiik 

The draMbaek tyf uting the ssae i if te cooponsnt to >oopptu HsplicK relations 
froa a bu lit- in set is that ths range of re4«tiens aoaeiele between the 
coepeurrf's components is each witter then ths ■*m*-&'#>§WlHai*.'+*' *Y\*mr 
genmvas or sVsple noun eoditters. At'.sjis^s1fef , ^ls1JI'-H^- ; s^'4^'--iMt, and 
^thi«'iiD«i'ld'''«sajt : %slt>is^tilTi word ,pat**%ewfb* k mm ^lejjjfV&H relations that are 
coaptotely idto»9n©f«Wt:. it u a lmtm%^-**ftmm*&fal ; **i>«»->ees#©und»' it 
obV{oti«iu'iloei''no^^ie^«ng ■ »n'$ftWP6T#B» fn%^»taaW^O%e v .the place for a 
dictionary defthttten, end tn fete* fte.MBML' oaf litfUeA tecHI*y in the 
system coulti hsnOfe *tbe BltusHon with no trouble. mm& for. ■ ; the tsattteit 
relation could be built In the --defniltteff; #***!•«' { IW coeponente 

as participants, fhis is clear ly an of f i«*eht approach, as long as the systea 
has a definition for each i dtoayneratf c c O Mpum i J ■ ff * &&& &*. 

Between the cospouifce tors*^ 
i d i osynerat i c eoapOonds are a grdep that show seas fna#Bf arHtf , Si though the 
particular relations involved am unpredtetebfo. These are -given 
repressn tat ions by ths esoentlc eoeponsnt, but an HUNSPEC ffor ^unspecified") 
aarkar is used. Ths barker It di souses* bblow In section 4fl.l t and the 
eoepound class is investigated in sort data! I in section S. ; Si 

Finally, tnere era eeapetmds for uhlbh ths seen log «f ths whole is 
different froa the sua of the esenlnat of I ts parts. Consider "dsr 
TintenfiStth- ("the cuttlsfish* or "sboioV litarsrtly, "Ink fish"). Laaving 
aside the point that the squid is not a fish to a biologist, no note that "der 
Tintenf isch" rsfsrs hot just to any water eniaal that speus an inky cloud, but 
to the cuttlefish. If there is sees bther f ish-H1» creature that also spews 
Inx, it would not be designated by TtntsnfiSfth. Such s situation sees* to 



... J*/ . I^^jj^:^;^ P-,'-- <• „'".** 
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warrant a separate concept llkf *SQUIO. 7: and ao the n«^t|i»t|||^/frv..7r|W» 
type pf compound look* no different fro* that of e regular noun. 

Fcr representing compounds, then, ths »uf tea offers four ej terneti vest 
pre-packaged inpl ici t r«l«tlon«, special dictionary dji#4alJ4one f 
representat J ons that I uvi the re^aJW on unspecified Jar , #e> A'»* *•*!»»» «"** 
standard object definition* for coepoureW mop«« meaning* are mpre than the sum 
of the meanings of their parts. 

Thit last group of compounds motioned raises • qMaetion. Jo not a 1 1 
compounds, in fact, tend tp pa aero. WltnJtoa **** ^hjMr piirta? Who*, faced 
with "GuMMiatiefel" (yubber boptf"), HP^noj* mmlM9**lmfc tM# #**ej#l 
type of footwear, juet ae m have apecifM: JnfornplJsPni ej»pu£, "Hchnftr^Wafel" 
(literally "lacing bpote" - any boot the* hat « ehjaHaeo) ej|dsJ , |lpl J »aj5»H*te" 
Cclogt ") . Ue would want to,aeeooi«te *hie InformejMoni uBhr %AVftMle 
concept ear Ker, rather than the aore general concept efWiyEAR* Given the 
system ue have now, if *ha reproeontetl on for : '6im j m|ot lefei" wee supplied by. 
SrKBrFpUNQ, there J* nowhere j^out the ^foipwtlee^hfh^ ,M» a*f*oW«», but 
it is one that the sys tee does not have to face, si no* it ..If, rest 4#^aaded to 
do any learning (i.e. it is assumed thit the inf creation in a eentence would 
never he ueed to pe rma ne nt ly Gtienga--the-|(Bjtiuo*^:flleMti»taK 

If ue wanted to allow learning in tha ey^eev 4i»;«>B^ try/ th« following 
approach. Uhen a new compound ir encwimered, «he ImpUcit retoVien could be 
selected by SnpOnPpUW, but then in.tead of edding thie to the e»««ntic 
representation, we could create a new concept marker. For example, a new 
"Gummistiefel" concept would have /FOOTWEAR aa genua and eoaethlng like 
(M1ATER1AL-0F X *RU68ER) as differentia* Then any new^inforeation learned 
about rubber boots, e,g. that they are worn In, the rain,, jmu 1*1 be associated 
ui th the new concept. In addition, we c«wA$ *k* • djcjt Jorwiiahdof irM .t-injp, for 



cut 

Gum.)«ti«*in . m that the next um of '^h* l«erd wourdMnvofce We hew concept. 
It would o»ly be'Cttenoeicel to touHd ns*f oencopti, b^%e#se; ;> if^ they are 
real Hi ueef ul for ergaftHeihy i nf orsetion #h ths deduct I vs date bete. There 
uoui d therefore heve to be soeb cr^toHen for how eueh spec 1 *1 Inforast i on i e 
needed to Juettfy Hie tre*W<eft of » sepereto wincept barker. ; 1he creation of 
nou concept eerfcere Houtd etee effect oeneretren, etf m i would need e feci 1 1 ty 
to updete dictionary definitions for the generator iccordrhlly; 

i 

Si-nce, houover, we orb not trying to doser lbs % ft >e»iim K for "learning, 
there is r» need tagmrtt* nw owrcept lurfcor*. Thr «>e©r<»l»* used in the 
eyetea web .to gN* eonedpt- eorliere to coepound* that had • yobd deal of 
inferos Hon aoooeletsd with theb per 811 ** -like "dee W ot w i iy tee" f ' "-the ; 
norvowe •« •**»". Hbrd« f or uhich «■» bulk of •odtief kiforeet ion would - 
probata ^ •"countered i*i the li«jiit-tol»^fi^%he 1 ¥»i^'^aa - Uke "dee 
ChrowstopborsnspJeP #<*piay of the Ghroaatophoree* - e>* repr e >» (> te <l by 
chunks e*-eeeah**b repreesntstion <&&mm%- pdr^i^f^Mek ' ^'Thttoe'erb- " 
constructed by SttOnPOWC, «• dseor tbod etMrve, wrtbse brf Idlosyncretlc 
relation le involved. ?-' : . ' ■ : ' 



It is thus nsososscy to supply iepHc It ret eTtoftfe fcr genitives, 
adjectives* - and' t unuuonue tot tht noun ajHsvjMfc 'ertfer >%b dbvbiep 
represen«e%ienb -for the differe** ;; peee1M J ff ?♦#•!■"' 1rt*feneret, oniy the 
deductive oo epo n on t cen decide between the -eef "of . different representations 
produced* 

4.S Ubrdt without Ssaefttle Representations 
In the loot seetteh ye reeked at #e&«enb tJaft were hot trod to 
epeciflc word*, but how tho question is whether an entry In the ssaantic 
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representation must be formed for all words. In fact, n»t ail words add to 
the representation! generally, Hoi^ajt|ha^ dp^noj Pf jo^l^aaafrMtJCsCfRtant 
and vary hi gh in sunt act ic funct ion. Jh* jo*! Mff '<ff ae, pjf , £hj a la 
prepositions that mark part icjjpanta in f^aiJoni! Foj f ai^ai||»Ja, 

Dia Eledpne reaojer^aj^ den Bal?, , , , c ? 

The eledoru? reaoJteCJg, Jtha stimulus, 
Such prapoaitiona occur both with varbs and adjectives, Jp* only the former 
uill ba discussed hera. Obviously not all propositions are Of this type; 
those that mark location, time, c«iMlltj|^^tc^a^ 4 ||ia^lp.^lv)a^i!il 
semantic content and not dependent, except in the iwet^oaf^rjl^uay, on the 
particular verb used. The propositi^ 

tied to individual verbs and whose funcjk„|oai„ are .ne^fe^afd b^ cf aa in other 
situations: "air gtflllt W " versus^ "Job f reue pf^djB^||r." jpjerman? 
(roughly, "I 1 1 ke &" and ■ I am happy f^yT^ le^l^versHa p^ajp^ in 
Engiiah. Uhere auch prspoei tlpne . pccur^ tbj^r r ee|an£tcfc }e.-eeeaf t|fl,lif • no- 
op, i.e. the seaantlcs of thf preDeaJJion^ 

marked by the preposition ries». When, the relation M»vclatfd with the ..jm* >• 
evaluated in STCLAUSE, a list of its reo^ired prepositions is retrieved from 
the collocations list. These preposjyon| r ar^ipdaaad by Spaaqtic definition 
labels, since a difference in preposillj^^ in 

semantics ("dlent els" / "serves as" vsrsus fdiei|| asu| fi / "?#/«•* *o"K For 
each definition, then, ue know exactly' whtrs ...to Joofylp thf afffntlce of ita 
participants. In the case of a required ^j^ejBoslt^on^Mg |ysj pick up the 
object's semantics from the registsr thft #J f •«*. J?tf tp*\ jaaanjtlal ly the 
same procedure is folloued t for ssparab|fpxf*lJ?f»« |l|Cf a^feoa^ab^e R*** 1 * 
verb is considered to be one word. This^unjjf^altyjf desirable in light of 
the close relation between prepositions and separable prefixes. 
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It is not, perhapsV entirety : *«»r to t1ai* that prepositions that mark 
objects of the v^b are devcHd tff eeirantlc ^ebn^enfF In"fafct, twal aurprieed 
at the regularities that I mrc<*jntered In tlSe course o* organizing the 
selection restriction tree; f^i^ilo^*r cl^Hhe^^'i^,' of couree, 1 s partly 
based on this sort of regularity^ A* en sfcempis, ''i$ft\£f]e&'<M'i"Bttntftl 
process (Including in this perception! i» '-ot^Mtftld 6y : M"' 
ta) Ich d\M*e"bft dirM. 
I thin* M. It' Of tan. 
(b) tch errlhere alch dariou 
I remember it. '" ' 

(cf Oas ! istf ifi Sepia zu beobachtan. 
That c»n oVoeaefved jn. ^tiv v 
In a sense, thesa^raguiarittes ara not surprising, since people have to 
remember uh I ch prepositions go t»ith Unleh verbs, and ins more regularities, 
the better; On tfMr other hahoV the ; situation J* complicated fey the fact that 
the same preposition mag be used with a rather wide variety of vef^e in a 
number of different wags. Thus we ha»s for auf» 
(d) Er wfrRt aii das PubMuaL 
Ha had an effect oh the publ 1c. 
(i) £r reioTert nichrda^auf; 3 

Hi did nor react 1ft it. 
(ft Hfr Habeh atundenlano auf Ifth gaJiarlat. 
Me uarted Iflaf him for hours. 
The semantics Bf these re$uii*ea I prebositions, then, can give a clue to the 
relattort betieeh tobjact and verb object^ ' but "tnf Iwiotil edge is rmvdr 3g 
definitive without the evidence" provided by the verb. 
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4.6 Idiomatic and Special IfcjfjM,.;... 

Another situation in Hhich tht words in a clauee |J,ght fe ngJ map one-to-one 
onto the semantic represented on it when Idiom* fra praaajt. 4 jp uelhfl 
"idiom" hero in the aaaantlc etnm, to aean any phraaewhoee *a*nTn« M 
different from the aua of the meaning* of its component jwo^ 
repreeentatlon for an idiom, then, ^Pn^^^ffff^^^l^,^^^ 
itself. In the syatea, the most general uay to handle an idiot it with a 
SPECIAL definition. For something likei 

Dae Ei sen Schmieden eolange ea heiea ist. 

Strike while the iron is hot. 
we could write a SPECIAL definition for echmleden ("strike") that would check 
for the presence of the rest of the phrase, th«p pro|||^ |.^ aaa^tJc 
representation to embody a aaanlng like "Act while there la an ofportuni^." 
Note that the system does not deal with such extensive idioms right now, , 
because none are present in the saap I a paragraph. „„ 

The system does handle acre restricted idiomatic usagea In two ways: 
through the collocations list and through the selector aechenlee. The 
selector was mentioned above in section 4.2.1. I ts pif pj^,!^ to croaa^ 
reference seeantic definitions with syntactic features. A^lac^l I ty not 
mentioned above is the appearance of a Mcrd, ? as ^wslj ai i.a ! ffajtjr|i,.,ae 
selector. In such a cass. ths seaantic definition if applicable only if |ha 
next word in the sentence matches this word. 

The word selector facility is obyiousl^liaitad,, JK*$ it wou^d notbf 
included in the systea if it did not coae ,.fJ!f^*|f\.J^"| o r ? 4?t^lV! j^Pr. 
collocations list is potential ly of greatej ,-%**?* ] *M% h ?X***^*}0*% *W »t 
is used only for associating prepositions with ^verbs, as^descr,Jb^l ^.n thf, !••* 
section. In the collocations list, ws can index the aeanina of, a verb by a 
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prepo.it ion or aepartblo prefix ifHriihtin the eorrtoneo. It Mould not bo 
•iimcort to .xttnd tftte aoehmiaa to haadfo idloaeHka our -Strike «hil. tho 
lrw» It not- oxo«*ro, dnd f auepecf that thia Mould be a good Mey to procood. 
Urttm d#ICf* definition for every Idloa that »e uant to odd to th. eyetea 
Moufd bo tie* e«Mf «f the coo. Mould I be rob*m»oua. Tho boot policy 
.<•%• to bo to uo» M* definition, apaHnyly. •** to uoo tho ^location 
tfbchanlea to rofloet tho regyforitioo that eon bo found. 

4.7 Special Eritrioa in tho Seeantlc flepreeentation 
Sovorol apeetai concept earkere are need I in t^ oaoantlc repreaentatjon 
ae it ia built up. Theoe ^ ti'vmm* Mimim"**^!*;, #UNSPK, 
«W«0U«O. arW JnttfttNt. Th. fir*. WORTHING, help, roprooont noun, that 
neae o participant In a rotation, and it* uoo "moo Ylluetrated in section 
4.2.2. tho thrwothor aeteconeepte, M U ili oil have been roplacod by tho 
tiae that the fipnorotor oof a tho .pontic roproeorttetlon. aro tho aub)ect of 
thia aection. 

4.7.1 Tho /UNBOUND flag 

Th* MJWOK) earlier la o teaporary placeholder which diaappaaro by the 

t.ao tho aeaatffic coap**nt finiahee it. uerkVlho aurpo*. of thia earker i. 

to arfoM evaluation of relation. before ail their^ticlpant. have been 
bound. Thia ia not in any Moy a theoretical naceeeTtyV alnce mo con elueye 
uattumHat. tt» pWUcfpenta are in befori tvaluatlno^Wotion. I find 

it a oatiefietory oetttlion, however, beeauee it keepe the ;"aaHntle cooponopt 
fairly bodtfWt that la, the **M*to aoc^l* •? ,oue w auch procoeeing a» 
pdotible to happen la ebon a. a'ph^'haa'baen'paraod^ " 

Air ari OKaapto, contldor the ptraeo # dle ia Wveneyetea varlaufandon = 
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ErreBungen" ("the excitations thfV^Mft TVHm&'^-IW^^dN****** 
I Harem*, "th« through the r>erv©ut aueMe rawing «*HeUons"). Stnoe we 
want to handle the pronoMlnal ot*w» "lal fc i HWUKaU w »<rlaMf<n<<iv"^— aoon •• 
i t i • parted, SttXMJBE proouoee the reereeentaAion i* f fcgure 4.» (end Maybe 
others, of course, fort the eltarnatlvs i«tere#*taUcntl* MoW tnet ihe . : 
pr epo* i t i oo* I . , ptaaee " Im.tk^^m^f^i^'* also •NwMNr^P.lii 4MllWiV ?*«t 
tho proceattag for tl»e prenoMtnal clause bat already de«a the binding there. 
The representation shown is laft at tha clautt node, wtwrt It otta MntM tha 
main noun luxtauogan •• P**eed and S0NC1 it activated. SNN61 calla tha 
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HIT* 



MJNBGUND 



INBlWBS-SVSTEfl 



,-.. Flours M> 
SftlOOIFIERS routino, Mhich it, as its nam lap lies, a general cl 



ing house 



for Modifiers of nouns, verbs, adjectives, and the reet. SlfCOIFIERS takes 
note of the fact that the Modifier it a prenoainal clause, so it knowe that an 



0UNBOUND needs to be replaced by the CBS for the noun. Swing thie, it calla 



tmite. 



REBINO to Make a selection rettrictiont check, do the binding, and supervise 
any renaaing that is necessary to ksep the tsaentic representation consistent. 

Besides prenoainal clausee, the flbliOtJNb Mechanisa is used for 
preposition groups (since prepositions are general ly represented by two-place 
relations) , for subordinate clauses of various ''sorta^ind for thoee adjective 



groups that are represented •• rs I atlonsr Although In 
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all 
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constituenlfe *bt «or^ittliint»<^ : «i»tr«1Mll^^ft 1 NlJtti'Min ported, 4Mhi«4 ly 
not ell will have been s»sluoted ■sssnttetTty ffsr swsepfs, tbs daftf ¥Mrt>) . 
Therefore, ^ ttw w of tht wmB aton e * f so iW>pmm& i \*'&m? coses, and 
to keep th*ngs aatffora, I Hwt«wi41 Wr< a g r» a ul , $^*a>t ! d* feature is 
probfbtg alee essful \* ! totsrpwttno, frietHh : fosses lug mpff-^tiftAr*!- «ttor high 
4owe4 oacVeiem era also k«pt). - Ths a»e«fcsrt*hi*e of ^sraen %yrtfe* " ^ ' 
iareiaaitael clauses, swd o rsa r ■ *srtie} .< Iwww , ad k s f mom a»W W^iartfldl 
binding MOhani»a«s»**t IS I. 

C 

4.7.2 Ths JUN6FEC Itarker 

KJUNSPEC, for "unspecified** ■ ■'*• J«'%e^ hatch for ths saaantic 
caaponant; it it a placebo I der **r seas of ths inforaetion that is left 
understood 'm the « utterance. Sees JMbjHX -esrksrs aay bs replaced whan noun 
group reference is dotsreinod, but ether* j are- Jpft J^ ! %ejefcductive component 
to auli ovar. Im aics^ia aliM is ussd 

for. One place far this sort of earfcar <'\» in relaUenal nouns. Of tan tiaa Is 
left for ths reader to fill In. ss In 

(a) Karl errinorto alcb an dot ftsnnsn. 
Karl raeeabsred ths rata. 

<b) Karl f routs tich suf dss Rsnnsn. 
Karl looked fortord to ths rac*. 
In (a) . ths raca jradataa Kar I ' s aontsl aft i en, in 4b) Kar I ' s aenta I proeaaa 
coaas first. Further, for aany usat of rs let ional nouns, soaa participant is 
left out, frequently ths agent. In a lot of eases, ths agent is understood to 
be the uniwsrsal anyons, for sxaopls, "SchJ leufsn ksnn gofahrllch agin" 
("Ski ing can be dangerous" , l.s. , anyone uhe skis can find it dongsrous). Not 
al I such constructs iaply "anyons," housvsr. Soaa agents can be unique, as in 
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"der Erfinder der Buchdruckerkunst" :. I'thi. Jjw^^j »f 1 fH|isWfl^"1^ iihj lo poet 
understood agents can 00J41 b« ^tp^i^^{^^H^^tt^4|p||tfwM "<*•• 
Abschelten des Straw" <"tbe„cutt*nB gff qf Jfce ^w^^^qflll^ L ^PH| bV • 
hoeeoMner. acoepaou., or •-r(f|l^ Vr |^ f^^ HB^J B f 1 *^^^'- V*"* 
to defer the dfcUloo until tore Jnfp£ae^q|| Ja, fjffl^j^j^^:.*^.,^*" 
ttJNSPEjp i» ^en.tlW.f > iMti.9«lj^J^^. : ..l|f •ffR| 3 ^S B r, f |*r*f l, t l, ! r ^ 
abgeschaltet" / "The power if bp.lm^l qff'l, r Ue e^t^ht^ae^fnge pi ,; 
possibi I i ties here a. in the re lit iona I pp^|^^^e^.,fj|nt ? , 

4.7.3 KJNSPEC for Ellipsis 

. The MJHSPEC uses •boye_«re.toei,^l|u «^$«^M^>^e a lf!$ < 9 M l In eoee other . 
situations Hhere infareation it left out t there eeffe to be a lexical basis 
for the deletion. In our paragraph, for eHSeple^tbere Is a discussion of 
whether octopuses can perceive color, ^ : After.,pWlo| ; ^J^ce ii Jhat eupjjorte the 
existsnca of color perception, the author ,§1^1^".,.^ ecJ^Mnt.das ■&*,.,, . 
eindeeten for dieeo forearLj(^„ejyp«j|, 

it least for these spfcjej, f«f*», tp. f^ell^l^SVlle^Jlllppris.. .Hfc^tftP^eTithw 
has substitute a noun group for a rsJai]o^ a »uc|i ae^^e |f||teoce Pf" Blue 
the noun group. It seefa to ,pe that abbreyi|||o^ ykf^|| deefof, ver^euch 
pn the special sense of oerticujir werdjy»i|h possibly, toaf iprouplng liito 
clasaej of Horde that el low ejiiler t«pe| 8 pt eJJjDe||) ?Si |p pf^aaplp^ the 
ellipeis eight J* triggered bit" ipr ec^,|fr",(feu0^rj. For^bjs^qaeee. 
then, it will be up to a SPEJJIAI. dffinJiJon,rou|Jne 9 tp Ja*rpdMce thf AUBPEC 
earker and do the necessary binding, Dedj^pj^cafijhen decide uhf^e^ation 
is understood. This def I nit I on approach i gmi*^>8}9^^; 9 lie6^>f&-X^ 19 
spec i a I cases • and knoM,, ubat , 1 1 neede to to ind f or^fejajih-pef £ij^e$ case. ,. , 
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4.7.4 The *H*FGR@fr Barker f or Pronoun* 

The /tdW-BRKNT **rk«r is uMd for third person pronouns, since that* have 
no concept »«r*ei»* to call their own. ThU r«fl»ct« th« fact that for the 
•eeanttc* of a pronoun, tie are total fy dependent o^ JJMiriitff Idti free tha 
coreterent noun group for on our knowledge** the 'loiiair ^iferaht, a* for the 
f iret and aeeond per ton) . The HfcftHEMVtrtrMr !• suppl isd by pronoun 
definitional and it Is •ieirar to IUNB0UK) in that Ft ha* been rep laced by tha 
tine the •eaantic repreeentatTohi reach 'thi deductive coaponent. The 
aechanisa for handling JWEFERENT is also functionally eiailar to that for 
^UNBOUND, in that the scee sort of r*binding is dons. 

Let us took at the use of ths <W3 : 0ENT worker In acre detail. First, 
for things likepereowil and rslstivsVonouns, ila^HENT eight be replaced 
almost ieeediatsly. SHNG1 causes ova luatl m of the pronoun definition, 
setting up tha ' "0tttM8^Sft Barker. S0NG2 probably mII I not be cal Isd, as ue 
generally will not have quail fieri fol I outng thesa pronouns. SHPftON is then 
called to hand I a refsrenc*. Its Job ir to construct e list of possible 
referents (using heuristics taken with II tt Is Changs froa Ulnogr ad's sys tea) 
and to eliainat* those that do not agree u I th^y^ pronoun syntactically (on 
the basis of genler and nubber). IntoMMfeNT' l^^1* : 'ithen f: ^ebour^ ; ' : to^each of 
these pee* i bio referent*. As seaantlc procetstrig continues, soee of these 
will probably be sllainated by selection restriction Checks, and th* final 
eorefarenee deci* ion* Hill bo aad* by tha daductiva coaponent. 

Froa what ha* been Mid so far, the r**d*r eight conclude that the 
JUREFERENT Marker I* not necedfary In every case. It Is true that if ths 
pronoun 088 is rebound i abed let* ly, then thi ear kef I* en oktra *t*jpu Even 
perebnaf pronouns, however, can aake forward r efe re nces , >^i eh are enough to 
justify the earker. Consider ths exaaplst 
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Ehe gr. ieaetrikuJieren Mnji, m*M t Jf$^^^}»0f^m^r^fW9 jpsfP"- 

Before Jig pan enroll, Jaujflfl **• *Q take tne ent^apc* •*#»*, ,.• 
Here, the 0REFERENT earkar it inserted for jr^ ? *ad J,*^!^ rjab^ur^g^tJie 
OSS for JfltMOQ MntjJ StlCWUSE |t J^M^fajr^^af^ ^cJUpue^.A^Jher^ 
situation where the f?EFJ58EHT, JiarHa^ la^wfjLltLf^,!! the jfc-coeppuooV »fb£re 
forward referencf is frequent, In tM)f|^sf^|>|j| JPf!ri99EIP l ^J H ^il!^T4/l^ ■ > -- ^ ■ 

...was abtnfalLi tehr sti|£Q^ ijiiMcbJW **M d^ese, Ti|f4 F^^ ru 

unterscheiden varpogan 
...which again vsrg strong^ to 

distinguish colors _ ^v-..,^- ,«.-.- - v ,, t ; -...;■ -^ :-'..;.-.,. ■•>-.*: 

Hare, dj,ffic. refer a f oruard to Jhg J^dass^lausa,, ^or ; |h^._aort|,o^^onfluna. 
the ^REFERENT aarker freg*en|ly raeejna M b oni : ^?^)|||' l l % t fFfiF"-MlU^.A B i tl1 
tha end of tha ,fan|ancf ?< u#i t ing for the, ^fjE,erp|,|g. ^fogqg^ . , . _ . . rt 

Uhan ttw BR&^J myrkpr Jsreb^oo^ *9J*"l :WPUW* "^V^W*?™ 11 
coreference. Since coreference in fu| I rjoun «greuj>| |Ul If^f^ - *?^*! ,M k * 
pronoun coreference, both ..are dif^taed JE||||thjr 5 |o |bj ne^| r ae^|tio^. 

4.8. Represe^|lng ^ejerjnpt r 
It aeeea desirable for prcwwnj a^dfllqitf nou^jrpyfie4«^« profcarlu, 

for a| I noun group* that ere coroner en|,ujtb „o*b^ tbji.te>ft) to 

have aieilar semantic rejoresenta^iooe, ,^« the tj^a W» f pner,tter sees th« 
representation, there will be no e*tpl|c)tio|»le|j||f^ f&^.t^ «&MHT>t*P* 
structurr contained a pronoun or a ful ,1 < oj^ej^. av {ty g f a»^ v ft»r^^lficai 
the target language ha* its own rules fr i &n%1^* i ]QWWPW'> a &-V'W un 
insertion, and these eag or aay not coim:id« with the rules in th^taurce 
language. In this section, I ui I J dlacusf f inding Jh|jor|f^e^rH^group f 
the representation built, and !•*«•• of pae^dOTCOFf ff^ce^ . Although y*l 1 1 
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speak of a WWW ija^-'Vefirriifca** *o aliither niun gVouirt reel ly Mean that 
the two aVe 4 cWtferihf;' ; 0Wm «xtri-I^Wfl<r^tFc thlrtgt (or quoted uorde or 
pnridee* Wfc 1 1 6gdr»t Fc dl«^^iW) art »fctoa I ly^ferf'W to bg noun groups. 
Note th^iW#an*ee WaA bi tleo% ta dtnar e tta4^iM^a by uiing pronouns Hke 
m #&'-Wkr' Wr «h taction MtO« ! ^f^^MI^ dn ndun group - 
coreferentfe, but 'th* sHuatiOrt li »«*» the iae* wnen * relation i» involved. 

For a pt-widuh, ^wi Ifftf that #£ eaeantlc eoapoftent eceueu rates a list of 
possible eoreferent atructures. For full iib# d §*ii^'' si ^ r yftuatton is 
e light I y dl f ferent, and the epproaeh Otitl ihW-her. feltoM. Ulnoorad. Pronouns 
are eo weekly specified seaanticeily that they eannof be' separated froa their 
rWfere^tiby agraat diitertce. Fuf! :; 'nou^'"ira^VlNd>aWar better 

spetlf f«d, and 4 r*f*rtrtt c^ldpotsnllillfi be found artywhere In the text. 
Uhile I suadffct thlV reference! outaidl a^pafairaB^^i ffaftad to cartaTn key 
Wtiti p/dupft, I alee think that doterirning tnis* noun groups Is non-trivial. 
I.e. hot oo^ioU'i frei tuffabe structure 'in'o^ery cess. " ; &> to proceed for full 
noun group* a* h« did for p>ohbuhs > cMstMifrtg i pdeiibifrtiea list and 
narrowing it - will not be feasible. Even if tie were to Halt our search for 
referents to the scope of a peraoreph, our bottlbi I itlss list would not be 
very interest ^^sl-hc% we Havl nb ''good' way to nerfw down the possibi I i ties. 
In so** artoattdhs u# sight be eblfc to use sa?e£tion r§W i£tioWe to narrow 
P08«ibnitl« 8 , btit In general they »m i^t W adequate. Furthermore, 
ddrWfer ? eht fult nooh gt*oobi do net 'agree tilth the If- referents fn gender and 
number, we would ha vs a potent Ta I ly outky poiiVBi 1*1 til* I faf'lhd rtothlng to 
de «i th i t. For WVs rdatoh, f t ts lift ta tjhd detefvl l eoaponent to 
ae^eiraihe which full n^uh o^oubs ari cbi^ef#ent. ^fthdugfi the deductive, 
rather thdrt th* seafhtib coatfoniht, wtll b§ adding dereference Inforeation to 
the tfeetafttlc repres^titfeh for fuM noUn grbUpo, I would Itke to finish up 
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the discussion of corefecence atthla tiwm* 

Ths Main festurs of ths rspesssnta«oO *i that* cent fe*ehtt tteas are given 
tho saee variable (register setting VARIABLE «K at though* their *98 Mm- RSS 
naees differ. Pronoun sseaotie nodss «4ll3e*ea«*its*i «hf*P €9WiPT Mtttng 
from the coreferent noun group «ne»>#€FSRBilT if nebeiadv *; #u#* noun group, 
on the other hand, always suppiiss Hs am concept esneees ; <6iven a variable 
nans alone, it is netjel*aus sasyctO' f t-ee^tbaou saaa^ tsa^nodeasaMrtWB the 
sane variable, bo ths rsgitter GQREF i». tet to a ti«t of sH the cor-e«eeents 
of this node. That 'IB* ahen ho <- Ufisb**jeoji re«eeencev»Me*sst the KJnW 
register both on tho rsfsrritig soMantie aadaaand^esmtha node nesenedtto* >Thlo 
will be. useful for generation, sines § bags reference b* one language atgttt be 
bettor translatsd as a forward *ref#rsaca^iiMM»etl»ar* : b*i»a* «<9fehcws ssojs of 
the inforaatioo that »Ui m resent 4Bftt|^^ 
groups « "the -eophalopod? ; and "this an|aal % ^ ^ ^ a- -u > 



OSSNAflE- OSSl 
CONCEPT- #CEPHAU0POD 
VARIABLE- XI 
COBEE. (0SS21 .... 
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CONCEPT- BANIrtAL 
VARIAB LE- XI 
CQBEF-,*08S1) mt 



so 



Note that other registers ars ss| ••©*?•*• I B *oe «era)B*rs«* ueeaantle 
that thou nay, and often ui 1 1, differ in BBBlFtEBs, dRt VEN >CM, otc. 

There is another pjtanoeenon that osnavs«:e»|ch tW* cereference* but which 
ho Might cai I p*eudo-cerefereoee. ConsWer the esaapl mi .■-;;< 

Anna benoUte das grosse I46r^ert**ehqundfi^ hJftiJBft' 

Anna used the big dictionary and I ussd tttt Hill t/M ■ 
The noun group underlined in ths Geraan is elliptic for "das kloine 
USrterbuch" ("the saall dictionary"), and ue can supply tho Main noun by 
looking at a noun group earlisr in ths sentence, "das greese Uorterbuch" ("the 



b i g d t c 1 1 onary " ) . Ue eight th Into e# the? •esatxt noun group as aatU ng a back 
reference t o " da» gnaas* W*r ttrboeb* " «m«i thtt 14 ee*«»a to Ida ganw-a I 
claae "Uorfcereucn" ("etictio«a»'tf , 5K r *»*»«• tha» to tl^aore raitrietad elaaa 
Ma»f«>dj^*§na*g»? an*thea*aajaa» no are, no* rooitg d ea ling with conference, 
btt* Milh ■ an abbreviated uau of ctt»tineu4*h*oe> tu» 00 1 i i» but ra lated - 
objects* Ue ropeaaent the .-too* -uUk dHferen* vartab***, but atab not* that 
the abbr^at^haaibeawuaed ^^ both noun 

group*. Th*a regieter la alaa «»ad to raapaaaM aara**a I Ua anhfb I tad by 
conjwnetiena* *I»> cue me a oi a ■ ■wU o co above, the WS's for the two conjoined 
clauMB uwld afto ba •ar^ad t :parailai^ nWAUAJ-lt aiao used for parallel 
**ln clauses, •*»** baaaaeted by a s eal - to longer In oaoerats sentences. Tha 
baaic tdea ie the* thU»b*t**e*le» «*HlkWmr1% nit*reaf ly language 
independent^ since eony languages M eh t hove different rtMee f or con junCtl ona 
or noun group pseudo-corefarsnce. Houevar, tha- a •**♦!»« f»ARHJ^L3 rag i star 
saves tine «pant in «atiing cooparttoaa In thf 



4.9 Saaantle RoprasonUtion far TheaitJc •fabturb* 
Thenatic suetaa* repreaent thsrergenlzation of m utterance aa a 
neeeege. ; ■******? tnayidro^hot restricted to hsedNedurse love I - Him*'. 
linguistic organization above the Mntanea). In faat, thobaUc eyeteae can be 
found at the uorcv i*cup k end clause level • of • eueteelc graaaar. * the 
distinctions In thfce * sect k« era sd s ptsd ■ frea Hal tidey' 9 work, hopeful lg mi th 
the or.i«lfaUb mtentlono Intact. Reader* ulohtag to fudoe for theaeblves ar* 
referred to Hal Hda^Uaail* 
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4.9.1 Uhat's in • Semantic Representation - Revieited 

,,;■■ 'j, ,{ : . . -/, -<■ : .-f ■, -:\ -i " ; ,i,..^ i »»j^- : . "■;""■ ;■ ,' *•■ "I; ■.'{'try;--- 1 "" '■' "■ ' • ' 

Before considering thaaatic phe^oaena in ^tfi|^l«t,u» return *o the 
question "What should a aaaantic repreeentitiqn rapreeent?" Ma can divide tha 
inforaation in a aaaantic noda <aee section 4.3) into three claeeee, onq of 
which uill be thematic 

Firat, tha aaaantic rapreaantation contalna Inforaation from the 
propoaitional level - eore or lees. Strictly _a||^|w^J, ; ^W o«MJdar_th« 
propoaitional level to ba ran knowledge - eeeentlce ainua tha thejiatic 
system.. But, •• ua uill aaa belou, tha objact-ralation-proDartu distinction 
can ba callad thaaatic. To call tha propoaitional level semantics ainua 
thaaatic Inforaation, than, aeens that; it f ls ,« •*^eeei| 4™ : , • v • , ° f 
organization. Lat aa tharafora quail fg tha or ||Jna^ "t^aaant end .ay that 
the aaaantic rapreaantation contalna Information froa thaj»ropoaUlonal level, 
augaented bg the objact-relatipn-propartg d^at!nction. In th I a category. I 
uou Id place the regiatara CONCEPT, CASE, LINKAGE, VARIABLE^ and Hj^E^CE- 

SCOPE. _.^ t ,.. . Al£ .„ <! . ,.,,.. 

A aacond Kind of inforaation in the aaaantic rapr«»wtation relataa tha 
aurface atructure of an utterance to thia aaai-propaaitlpnal lavel. For thie, 
ue have the registers TYPE, ORDER, RESTRICTIONS, 08SN0DE, R8SN00E, PSSN00E. 
and PARSENOOE. This is inforaation that ia wefful In^dfCi^ing Hhafhar a 

rapreaantation ..l« ,*P? p ?» r A*8»' xjff'f i^e^g^W?^ 1 f S^Rf 1^1^ 

rapreaentat I one, and keeping track^f thj ^ f |ha q eea|n| jc reor«aen *a* ion 

correaponds to the egntactic atructure that |s ^log^Ut bj the p^eer. 

The rest of *he inforaation in t,ha aaaantic rapraMntation it thematic. 
Thia includes ANNOTATIONS^ GIVENtNE|, ^COJEF, PARp||, jNFO^PROER, OA^SE- 
TYPE, HQOIFIERS, T«n|, and RE^TR?CTjp|SCRfi|, . Jhe raa^indarqf^^ section 
uill be devoted to theee theeatic categprjfa. _ 
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4.9-.Z Thematic Orflani ration Be I oh th« Clause L«v» I 

Hoot thematic Information seems to be related* to the clause; the group 
level thematic features that utll be discussed here are also found at the 
clause level. One sue tea discussed f' In Ih is section -information focus - is a 
discourse system* and should property be represented aeeve the sentence level. 
Sine*, however, its manifeetetione are seen at "the group and clause Isvsls, it 
ul II be repre*snte^ af the rllevertt group and clause semantic nodes. 

A thematic phenomenon at the word level ts connotation, related ae it ia 
to the tjweker'e choice between different ways of express log the eaae concept. 

Looking next at the group level, the head / modifiers linkage eeeme to be 
thematic. Consider, for example, the difference between the t no noun group si 

fl) der btaue Hlamel / tht blue sky 

(ii) die Bllfeie des Mimas Is / the blue of the sky 
Ih the semantic representation, the only difference between them Is the 
rtuDirraiB register. 

Another group level thematic feature is ths cHetinctloh between objects, 
relations and properties. Objects and relatione can both be seen at bundles 
of properties! in defining them, we are making a commitment to a coherent 
world view, i.e. to some sort of "Identity* in the eeee'of objects and to the 
assumption of ^elatedness* instead I of randsensss In the cats of relations. 
This conceptual leap of faith doei not tsee to be "s conscious choice on the 
part of m Indlvldusl speaker, but rather a choice that Is built Into the 
language. The reason t sag language 'here theteei^cwieepWr structure ie 
that it ie posslbts, for exempts, that the concept of 'objecthood" differs 
fro* language to rengueye, ai Uhorf contshds in his analysis of Hop) (36). 
Niver the I eis, for the translating systsm, the object, relation and property 
distinction is expected to be m I ntalned in ths deductive data base. 
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This it because the distinction seems to bo integral to both Engl leh and 



German. If, in fact, languages do diati nguiah objects, ra I at ions, and 
propartlaa in different ways, and if this.raflacte deep conceptual djfferencae 
as well, than tha conceptual ^raaj^an^on^aai^)^^ heavily language 
dapandant in this rsspoct. 

Finally, tha diacourss catagorias "given" and "nan," which belong to what 
Halliday calls the information systea. alnaaaear — a grots level thematic 
feature. Since this systea is realised primarily pbonplogjcelly. only certain 
aspects of information organization are going to, be relevant. GIVEN 
information is that which tha speaker (writer) thinks thejiejener Jreeder) 
can deduce, either because it has been stated e»«IJpltly or, because it. ie 
common knowledge or because it is ^ in soae f se^unj||i| f a^c.^JfU InforiaBtlon, 
on the other hand^ .is the^rsason far tha,^J^a y a|^giea tB 4 4 f % Jha . ;j . 
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information which ho wishes to coaeunicste to tho reader. 

One place where the given-osu distinction it reflected in text is in 
definite and indefinite determiners. The d if terencs between the noun groups 
underlined 

(I) Ella BiiCh ist verschwunden. / Ihj. bjMk. hes disappeared. 

C 1 1 » Eln Bush let versehwwnden. / A IBBA hat dleappeered. 
ie that in U), the reader is expected to knew, or very «oon find out. which 
book ie Meant, white in (II), he it not. Ail seaantic nodes correeponding to 
noun groups, then, are wafked by the transiting suttee with either GIVEN or 
NEU. Note that this ; Means that aoaa retejjji nd properties are also Marked 
along the way, i.e. these that ars expreeaad jm noun groups j "dae Scbwieeen" 
/ "ewiMPing". "d+e ilftos" / "the bluenete." Pronouns, of ©puree, ar% 
autOMatically GIVEN, since their referent ie always expected to be derivable, 
either when they appear (for back reference) or as soon as aore of the 
sentence has been processed (for forward reference). 

Another piece that the information focus eyetee is reflected In text 
(at though i t is not now handled) , is shewn in Ins Veil towing sxaaptei 

(i? der oiaue Klotz / the blue block 

(ii) dor Ktot*. dor bleu ist / the block that' la' blue' 
One Important difference between these two is that the" entire noun group in 
the first example oust be GIVENi while in the second, the subordinate clause 
allows "blue* to be NEU, even though the rest of the noun group is GIVEN. 
At the word level, then, tnere is connotation, and at the group level 
there »% the distinctions between head and Modifiers and between objects, 
relatione, and properties. Thess two group level distinctions also appear at 
the clause level. Finelly, while the i nforeat ion focus systeeie properly 
discourse level, the given-new distinction Is repreeented both at the group 
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and at tha clause lavel. Nota that two other registers in tha seaantic 



representation related to information focua 9f CQPF and PARALLELS, which 

.. ■.-,..-■-..■■ -•■'.•■■:<'.•- ■.,.-( -:. . •■■■•:. ■:■•;.' ■■'•>?■':■:.■; *£' s.. : *Sf**-| $V.v': I,*?.*:* ? *^s..' ; '-"- 



uere diecueeed in section 4.8 above. 



4.3.3 Theaatic Organization at tha Clauaa Laval 

An iaportant clauaa lavel theaatic systee.ie the t h eae rheae 
dietinction. In terae of Hat I iday'e definition, the thsee if the firat 
constituent of the clause. In the translating eyetea. thaeafore. the theae >• 
narked bg the syntactic eoaponent when the clauaa if Pftaed, but It is also 
given a eee^tic representation. In semantics, the tha** rajiatarpn the 
clauee RSS is set to the eeaantic node associated with the thee*. Ha I li day 
has characterized the theae as "the peg on which ths^saege.ls hung" , fg 
(le.p.lSl). The rheae is the rest of Sha.cjauee., |^.|ai^.o^th^^^fe^Bj«tlwi 
structurs, the theae is often GIVEN, as in: . ,. . 

Thie is not eluaye the ceee. however, a# *bQMnby thje^eanjancp froa t.b« 
sample paragraph: ., ^. 4 

Nach von Hess, sollen sie slch wis der ferbenbiinde llensch verhf Jtffi. 

According to von Hees. thsir (ths caa^iopo|^ ^b«t of a 

color-blind aan. 
Froa the reader's viewpoint, theae acts as a set of direct jone for 
interpreting the inforaation in the eentence. Uhen the theae U G)VEN, the 
writer is saying, "Here is a concept with whjch you ars feel liar, on which you 
can hang the inforaation I aa going to give you." On the other hand, when 
theae is NEW, the, writer is •e.ttjng f |ha fean|, ( gJv|no^i0fog|ift|aB ha,ccfiaidara 
helpful or essential to interprfliflfyha^ will be ^^aa^d^^ba r§#i of ,tha, 
eentence* In the e>caaple given, it _i* ;i j|«>or|ant.#or,^h^^au|JhBr,tf qualify hie 
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stetseent by ettributing it te von Hess. In sssenee, hs it saying, "Don't 
assuee I believe Hhat i m going to toil you." in tbo exaaple sentence, thlo 
qualification is aiso expressed lexically by the uso of the vorb fioJlftD. 
("supposed to be"). In fact, the author goes on to give evidence that soae 
squids do pores ivo color. 

Another theaatic Mechanise thai relates to what the reader is expected to 
do with the inf creation he Is given is Mhet I will call the rsstrict-^describe 
distinct Ion. tieeieaUy, a head-sod Ifl or type relation is RESTRICT if the 
Modifier is expected to give the reader ussful or eeeentisl help in 
identifying the HEAD. DESCRIBE Information eay alee be ussful, but it is 
treated by the uriter as supplsMentary inforeetlon. A relation has the 
attribute RESTRICT uhen Modifiers are used to Mbit the reference of the head. 
*Der rote Pulll" ("the red SMsatsr"), for sweep Is, exhibit* this kind of 
relation, sines not all sweet art ' airia rliit'ihe*- the ad}ec tive has' been ' used as a 
distinguisher. '' : ''^^^^i^m^mt^ (*tho red fhm tntftfW*), oH the other 
hind, uould probably be * UESCRI8E relation, this ts bscauee red ta generally 
a property of fire engines, and so ths Modifier has been used i purely 
descriptively, rathor then as an attsapt to alrtgls wt a particular object. 
This distinction's reflected syntactically in Me English rsstrictlvs and 
non-restrictive clauses! 

(a) Cephalopoda that I iv* in coastal ersaa bul Id thslr housss out of 
stones. 

(b) Cephalopods, Hhich live In coaetsl srsas, build thsir housss out of 
StbnsS. ■."■■' 

In (•), the subordinate cIsom Ts used at a distinguieher, uhlie in <b) it 
gives •uppleeentarg descrlptivs irtfbrsatlon. No ts the t English requires 
coses* for lb* but not for (a), and the distinction Is often eaphaeized by 
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contrast ing l&aJLind efcis&. In Geraen* en *ne other ?hand t ths translation for 

both (a) and (h) Mould bet 

01* Kreken. die in oar KQstwuens isoen* *a«e« thre Uoh*»hih»« bus 

Steinen. - .-,.-. ^ bu.-.-^ 3 •...--■& . ■ 

or bat tar «• 

Oit in derJCostsnzene lebenden Krehs* seuen Into Uohahdhls eus Steirten. 
Sinca both typss of clausss hava tha sees surface reel igation, a aa ant ia 
knouledge auet ba uaad to aaks tha distinction. Hara mo hava a aituation 
uhere aaaantic interpretation i ■ nacaaaaey; lor Cataajk M ioatlah traaaCitia«, 
• lnc» if mo are to chaaaa tha capfacttEnaitawsraa r aa aa tation for ouch a 
ciause, Me auet interpret the Gereen sorrec**** 

Another claussiayeV feature i« ths registsr WeTS-ORDER. If one hod to 
classify this rsgisisr, it Mould bs ••• I gned to the l fereatM>n focus suetae, 
although tha IW*)-QR0ER rsgittsr ItssAf -Jm* vt*y id haa«aoaaara# Tha 
aaaantic coeponent checks ths saaantic cists of adverb i«i« int** -elme* and 
Marks INFO-ORDER accordingly. If ths advsrblal ordaring is tha dafault one* 
tha register is sst to UhWKED. If 4h» ofjdeeing is not the default one. than 
tha register is set t« ft a list of ths^'fof^tfiayadfaBtoiaiaf *a the order of 
their appearance in the sentence. Prtweebly. *hoao e ipea^is*c ; te^twe«*end-aef tha 
clauee~ore,HConatdscod'eeeJ^ Abates ie«eaaje 

other reason for the pj*dering. Ilkf ab^a^ :? aed4|la^) %a a«^ ? 1ihis a»»ht *a 
useful inforeation to prsaervs for ths generftor. 

Qne last theaatic. category uaad^pt the^iauaa is«el is ajUP-TYRE f iMhlch 
is a register sat on the RSS copp§|»^nd|^ 

hava the values CO^ANQ, byESTION, STAIEKNt, or efCOlBABY, It is asauaed 
that this inforMitonMi^be^aedhtMntu^^ thesdeduotive 

coaponent. Us would sxpsct ths rteouct Ue spos s nMi to hooM so— thh»g about 
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the implication* of the typo of clmiiUHd, i.e. the preeuppo*ltion* and 
expectations associated with it. For connected text, thl* sort of infortibtlon 
would priearMy be useful for dlusblguetlen, I.e. to Indicate the ways the 
inforeation in a clause could be used in deduction. For more Interactive ueee 
of language, the expectations aseociatsd with OnUSE-TYPE would rnoMedti the 
type of action that euet be f«rfa*a*eY''«^^^^ finding an 

answer, etc. 

4.3.4 Discourse Senantic Structure* 

Original Ig, I planned a separate d» scours* eeeentic structure (DSS) which 
was to be associated with each sentence and carry interest Ion about 
intereentential relationship*. Except for th* Inforeation eyifea treated 
above, however, ve*y imi* of this ***as to b« eerivrtl* free th* *urfec* 
structure of a text. 1 will defer s dltcussfon of rfJeeourw level 
structuring, then, to section it 7, 

4.1* The Place f*r€iee 
A cass graaaar in the style of FHteere aeea eiaantic cats ln«or»at»on 
for several purpose*. For »oee of t hew funct Tons, I have used other 
Mechanises in the *u*t*ev U>wn particular proposition* are required by. the 
verb, for exaapfe, the col tecaffeni tilt is Used fsictf ion 4.5 above). For 
objects of the verb in general » F hive fcgneiH^ceee entirely, eesuatng that 
thie inforeation would be ussd at the deductive tevs I. tfihogrdd's systee hae 
the ease-like global variable SHJC f toed tionh which is bouno* to a Ideation 
required bg the verb, ts in 'Put ths book nn ^ tabfe. fly systee. In 
contrast, uses only STONE, SnTW, st* SOTHTCE to *pecHo participant* ht a 
relation. This was done bsceut* wlection rsttrtctfon* filled th* role of 
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case herei SnLOC and tha othar glob* It that one could add, MKa tl»« and 
aanrwr, Hare redundant. ^ ,« 

Seaantic caaa doaa, however, have a place In the.tranelatjng system, 

.. ... ■■..,-;..} .«■;., . -, /',■■ '.. ■ : "* ■ - " .?;•' i ' ■' ■'.y-'* ^i^-*.®fc'"-j v^t i f •.©&*» sums-.' u.^fe--..^.. 

although in giving it one 1 have extended the meaning of the tera. For the 
rest of the sect ion, let us consider eodifying relatione exclusively. In tha 
eystee, eclection restrictions epecify the constraints «aodlfiar d I acae, on 
its head. In addition, we also need a i^ to^axproa^tbg^tj^a c^f modifiers 
that a head can take. For exaBple ?f He t? lght ^yo T aoja|i|v s tl 1 a| T avente f can 
be modified by loc^icfl, tlaf. and aannaf, or^t Jf^ffl ^Jectf^Jave 
ahape and qolor. Thess constraints are npt, noH s ja^laaa|ta^ u |j ; nc^, I fjfact 

thea to be eabedded In the (^ll^^ll^j.^eiV^ilWP^jJtSJ^ 
characterization of tha.in^lyiM^ao^l^ng^aJalJaiiJi ^ h|g| 4| Jeyel ., .. 
categories which I Hill caj I semantic caees^ e^^|||||a|f^l|H^^I^,.a^Mi v 
etc. 

Uhile the constraints on modifiers hays noi,baen,ia^|eaantad, seaantic 
case, information does hsvt othsr u»« In the suit**) i? |he^ ? ru|f i^fo^.ordjrjng 
modifying relations are expressed. In taj^.o|^paBf.^J^|J| u !|g.|j| graue^groeee 
Tintenf lech" ("the grog big saujd") soundf strange J^bofh ,Gpaan and Jn$l lah 
because the rule "size before color" has been violated. (Mote that ue May 
need acre generality than the staple caee categories to express all the 
ordering rules, but at leaet caee goes a long way toward expressing the acre 
common regularities that occur.) Caee also helps to explain verb modifier 
ordering. In Geraan ue would be acre likely to eay "Ui^ traffan una gaetern 
(tiae) in London (location)," while in English the acre frequent arra n gement 
Mould bet "Ue aet in London (location) yesterday (time)." I ehould add that 
right now the Geraan end of the systea does nothing mors with caee than find 
it. The seaantic component doss not care whether it sees "dti grosse gratia 
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I- 
Tintenf ieeh" or a "grati* grosse" one. These rules, of course, could be added 

eaei ly enough. S s a a n t 1 c eaee information does have a place in generation, 

however, einee eodifler ordering rules in the target language are expreeeed in 

teres of case. 

The aechenisa for retrieving ssaant tc caae Information is a eimple one. 

Since t am assuming that different concept ear-mere aiuaye imply different 

cases, the logical place for cess information la the aelection restriction 

tree. To but Id the s aa s h tlc representation for a relation concept Marker that 

acta at a modi Mar (including, of course* eY^-MtiittMftiv tie trace up the 

s«l set ion restriction tree until a Barker la found mVth it» CASE property set. 

Semantic casss are associated with subtrees of the aelection restriction tree, 

although there asy be s«v«rs I css«» • long • branch. This atloue ue to handle 

exceptions, since the fir»t case found Is ths one used. 



In this chapter mo have di scu ss ed the semantic representation and the way 
a eat of representation* are eeeocieted with individual se n tancea. The next 
chapter diecuesss seas of the issues that must be considered if we are to 
ehooaa a single Interpretation for a given sentence. * 
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Chaptor S -- The Rot* gl^^lilpr^^ifHHr^ , ; 

5.1 Introduction 
Tho procets of understandlno • Mntenfaraeg ba $fcougbi of oe tho procooo 

of relating it Jto on intarnal knowledge ftr^tunf. Anfto olroodu •witlon^l, 
tho translating tyatoo lapJeeented eakef no mojur^ toward unperatendlng, but 
instead bj-paaaes the prpbieji antlrf Jg t^th^no poooibU. owpoptlpn pf 
ooto^tion rj|jtrJctlon|)* XWa ojUMlon,4rlse*na* ^ ttf jihOjJipl^f. *hp% 

undorotona^^Ja^uniojp^tini tp tronj^ion^4i^ ;4 ^l^f^ji|^o^M.thj|. S4 ,^,... r 
convict ion that a fragoanUry o^^imi*no«opH^|ji»5jai#l^ . I#|J i# ,4b-0; - 
understanding. CQOpononJt £ ,,for ? tho 9\^^^j^^ ^Mi0^pt^l 4 \^ m^^^ 
needed to fill this gap. U not aa 1,1 l-OsHned -,^J\ jfiqi'ia^.^l^ej^^iojrjK by 
MinoKvi (27), Charniek B&? Gpi^ato^ (9J, fWkroott (2W, and S«aee*n (34) io 
oKtrooely exciting, ond con«tltut«» »ut»t«t^l proorooa toMprd j}/$W!ilt---° f 
repro«enting and »tructi^ino KnoH(l»dao. rv-';t..-^---- t.v 

The chapter Jthajt f o 1 1 out rjayas^heewj l^ j^ ^.idoaf, Jn ; |hi* rafjjreficea 
c.i tod above, but, I... wi 1 1 bo considering ihj^ 1^* ro»^r)ctod o^oatlpno. =,f.|rot, 
given tho sys too JhJit \* deacrJbad hora. 4|ha| ajr# of ( 4j)ty|'#ojtfe^j**e»f«t "• 
.expect between a KnouJod||i,aJrucJtuf^a^^i^t ejy^jye^^JSfccpnd,? I 
will take a short look at thp sorts of sj^j^jfaji^ej^ihati^ 

and some of the cue* Hf .can, tfjup ;9^\fa&fc)4p f^tftf 1 *^- i*-^*****"* 
to our "black box" aa the deduct I v* copponont, although ^ie.Jterji, io 
Misleading, Deduction, i* ; prob^ig,e%>lapa?^^ not 

nacesaar ij u tha Pf P*Wi «»chJhMo. &-nm ^*M-MRrdts*49eVPJtM^ r l natoadi *>f « 
"understanding," however, since the IMensptiono outiVnod bo two«n tho 
component and the systeo here eight not harden* IsOjM ,1m Mm 4***eca€MMone 
between a oore general ,"^Jnoaretandor , ! ■<*& .*#iaj|oo^.e9bJHl*lJUe^aMajM)iowN.' 
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In what follows, I will assume that the deductive coeponsnt it written In a 
language with at least the representational power of Conniver. 

5.2 The Basic Functions of the Deductive Coaponent 
UnderttamHng plays a crucial role art the interpretive snd of the 
translating proeefst we need to une)sr«.ti(i^ : 'tn' ; o>«ler to decide which sense of 
a word i* intended, to untangle pronoun re f eren ce s, and so on. Ue would 
t hers fore expect strong Interaction between the deductive coaponent and the 
parsing and eaaantfc coaponanta. As will be dlscuseed In tb# next chapter, we 
Right also need to draw on our general Knowledge 'atr^cture for generation - in 
particular, when paraphraee is necessary. This ssees to be a aore specialized 
k«chahf«a, but I am not prepared to dlscusa It furtftbr, "' so it ui 1 1 not be 
considered hsrs. fhie loaves, then, the Jntsrprstlvs rols of the deductive 
coiwjonent, which can be divided into two functions! disambiguation and 
supplying inforaation that is isplicit, but not s*pMclt, In tsxt. In tsras 
of the system here, these tasks can be rs for aula ted as choosing between 
possible semantic representations and filling in the slots loft open in them 
(the *UN9PBC merkerl. Theee processes ere mi Independent, but rather 
intiaately Interrelated. Clesrty, a cholcs bstwasn rsprsssntstlont is bade 
eaeler whan all the information is in. On ths other hand, us can sake a 
decision about implicit information only uhsti ms hsvs committed our salves' (at 
least teaporari ly) to a particular context. The situation is not as 
hopeiesely circular as ay preeehtafcien of itt f aarcly wieh to emphasize that 
impllci t inforaation can have its usee In dis*Bbfg«liWon, and disambiguation, 
in turn, wilt supply implicit inforaation. 

Ue cannot really ask how the deductive c o m po n ent wl II interact with the 
root of the eyetaa before we esk when it will do so. Ideally, of couree, the 



T=mm^m-- 



123 

understanding section of ths systss uould bs active, directing the parse. 
Since, however, tout InterpretetJon Is iW I «t the stage **eret syntactic 
Inforaation ie usually the best inferaatJen? svei labia, the question ie reel ly 
when the eeeantic coaponsnt should call deduction. A likely piece to eeke the 
call is at ths snd of sach seeentic specialist, so that had repreeentatione 
cen be el iainated right euey and delist rhava to be ear r led forward. In 
addition, if ali representattsjiear* rejected by de du c t ion, us have eoae good 
Inforeetion to send thopareer. Ihs eaoeerdeaXa^en^reeilree? the erv^, the 
teee coaplieated becKup Mill be, since ae are still st or near the scene of 
the difficulty. 

Understanding i*net, however, a; al^vaoiwl lth*c aroeaesc 8o«e 
linguistic structures require « deley 1f» parts of the*pMMBSSS¥ For sweep I e, 
where In Engl I eh we uould, soy* "Give ae ii» tfJl PJOili 104 UK blue, fine.," the 
Geraan could be, "Gib eir fefi CftlM tttfl *M WJbJiS IMlllll' <llte*ifTg. 
"Give ae Ibft red awe the blue pencil') . Hare, th# W*#t neuh< group in the 
conjoined structure cannot be feHg evaluate* #*Withb StebMfiae been par eed 
•nd evaluated. SiwHerly, a Geraan ehd*erder eleeee const met *^**ee*thet e 
good pert of the proceeeing of the cieoer oust usl t for the ss In verb to be 
parssd. Constreint nuaber one on our deduct ivs ceaponent, then, ie that it 
cannot be an ai I -or-nothlng process. Inforaat ion should be usable es it ie 
accumulated. The deductive coaponsnt eight be ^ abte? to reject eoae possible 
semantic represented on* for a given noun: group -■tmfkrw~tm'Tf9% of the 
sentence hee been evaluated, but a acre l1ke.l#''fune^'on ! ^i^ ;: !be to reshuffle 
the priorities of different possibi lit iss. This updating procsss Mill be 
diecussed further in ssction 5.4. Related to the use of partial inforaatlon, 
ue uould elso uent to activate information hot just for full graaaatlcel unite 
(e.g. noun group, c I suss, etc.), but alto at esrtsfh iaportant i n termed I e to 
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points, for exooplo, a* soon n a vorb U found. Thuo, the lopiOMntotion of 
■ the ■ doductivo caapeneat out! inod haro wouHd alee reoulre *oeo chengoo #h the 
oe n en tic opeciaiiato currently in t»w ma^a. 



5.3 fWiatlng a Santanc* to tha iteowHKlga Struotur* 
■-<•*?* It r« attuned the t the aadiiBi I v rraaeiwit «m bo activated by art:' 
appcoacr, lief lar to thai uood i« itinepred'o oyatoe. Bach abee4bfe eaaantlc 
representation el • son tone* ia convent od into a net «J eoeertlono for tha 
deduct Wo data bwe, and procedural aro eutoootlcalty bu»M? foea theae 
a>— rtion» to par for a tha nae a pw un de nt ha ) preeeMtng. Convoreion of the 
•eeentlc ro pro— nf t>on to on eottrtlon *»t i» atraleMforttarct. Tho variable 
in an OSS !• co«bli«od with tho concept oarhor and too ralat ton #18, for 
•xoapJe, .. (#1$ XI i^TOrVSh Each «k* ^ 

ae jm4 J * and an aaeertien nana can appear op r an argtrnt 4n other tap lee. For 
RSS!#, the roiet i an concept aerfcere era conHnad «Hh the var J abJaa of the i r 
arguoenU (if theaa are OSS' •) or tho aoiarUon naaee of the reJetJene foreed 
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froa ar ouasnts (If thsts ers RSS'e). FinaMy« aodifvinfl rsletions ere given 
•apart jone beted on concept Mrkvrt end Mserrtleet beat* on oase. Figure 5.1 
is an exeaple of the assertions that could be prpduced fet.a aodtfylng 
relation. For thf special ceaeof tb§ iw^lf^lnQ relation /WAVfErl'ROPERTY. m« 
build only one estertion, nepslu ont aedt up of the cat* Pi tha»pepper|y r tha 
var labia or astertion naee of the bead, a^,t^,ara|»a^ ) ty , aiCom^|j^l^,. 
Figure 5.2 alvee •" axaagJa of tfela, Th«ss assertion* do i»| t ra|i^aaafit al I 
the inforaation in tha aaaantic rairfaahl;^t|oj! K a^ s l$ If wppfeteo; that the 
procedures ul If use theaatic lp|oi"aftlop^« f ,t^a|| ? fo a^£ Jujr«^, |be 
assertion oat Into routines to b«UMd by tba <ifductiv« H po«^Jonwit. Just how 

thit uill be done, however, tetM to dsjie^d very auph on thf way general 
knouledge i« to be structured, so I Ml 1 1 not contWer tha qMaatlon 'M****** 
here. 




OSS 

CONCEPT-, JELEOONE 



A5t WIS X3 4ELBXME) 

A6t (COLOR X3 iPGRAVISH-YELLGM) 

'figure '5.2' " 1 

When tuples like those givsn abovs «rs asserted in the deductive dete 

base. It is expected that they Mill triogw all or «*• part of tne related 

Information thet It stored as psraahent 'knouledge. ''"Ifri '$e' •'iajpH* hierarchic 

■odel froa Chapter 3, this would Involve a chain reaction up through the 
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hierarchies, to thet (#IS XI #0ETfJPUS) Mould trigger the assertion (#1S XI 
MJATER-ANfflML), (*TS XI tft*f®HtWft1t) , (#fS XI JWHMAL), etc.. along with the 
related inforiat ton in eedh ease, float assertions Ui 1 1 be aori complicated 
than theae f IS tuples, and tie will went to be abte to record information about 
the status of an assertion, e.g. under what condition* an assertion can be 
expected to hold. Certain at nival distinctions are assent faf. First, we 
uould want to dietinguish between a relet Ion that lie know does not hold and 
one about which ue sieptg have no Inforeatlon. In addition, ue want to 
distinguish between a ralitlon that is unaessrtsd but could hold for a 
situation, and cms that is not only unasserted, but also irrelevant. For 
example, we can auk "Where did Harvey put the beak?* but not, "Where did 
Harvey put the rapidity?* Location, of coureeV ii relevant for concrete 
objects but undefined for abstract ones. Furthermore, we would also want to 
qualify assertions in the "yes" and "no r ststss, by the sources of the 
inforaation, the tiaes ths aaeertioni mtM In this stats (soaetieeo, often, 
Mondays only), etc. The ainiaal nuaber ol different assertion states, than, 
is fourt "yef (It holds), "no" (it does not HoM, and two verletes of 
"unasserted," either "defined" or "undefined." In general, It is probably a 
non-trivial task to deters ins whether or not a relation would make aenee if it 
were asserted, but certain broad distinctions can be aide. 

5.4 Positivs Selection Restrictions 
The use of a verb raises csrtain SKpsctatlons about ths nature of the 
participants involved. Siailarly, attributive adject lyes ra I se anpec tat lone 
about the noun aodified, and prepoait ions carry constraint* on their objects. 
The selection restrictions introduced in chapter 3 allowed us to express 
ainiaal conditions on participants in a relation, but there is eore 
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information that is potentially useful. For example, if we knot* one of the 
participants in a relation, ue often have a Much better idea of what the 
others might be. Let us call the restrictions from chapter 3 negative 
eelection restrictions, and define a positive selection restriction to be a 
block of information about the expected participants in a relation that can be 
used by the deductive component to decide between possible interpretationa of 
an utterance. Uhen we try to specify more closely what positive eelection 
reetrictions should look like, ue run into some issues uhich ue did not have 
to face uhen dealing uith their comparatively simple negative counterparts. 

As mentioned above, if all possible semantic representations for a phrase 
are rejected, the parse itself will be rejected. Since ue might expect 
positive restrictions to be rather intimately related to the general knouledge 
structure, ue could find ourselves in a situation uhere an incorrect statement 
by the author violates the restrictions and is sent back to be reparsed. Thie 
is obviously undesirable; the deductive component should distinguish betueen 
false statements and nonsensical ones, at least as much as possible. Note 
that nonsense will be considered a misparsing here, since I am assuming that 
uhen a nonsensical statement actually appears in text (ae in children's 
stories or a discussion of "colorless green ideas"), ue uill have been 
adequately uarned by context. An example of the potential confusion of 
nonsense uith misstatements uill probably be helpful here. 

Eingehende Untersuchungen Ober einen etuaigen Far bens inn der 

Cephalopoden sind sehr eruttnscht. 

Thorough investigations on the existence of a sense of color in 

cephalopods uould be very desirable. 
Let us assume that eruttnscht is defined by the concept marker MIISH-FOR (Ue 
might uant something more precise, but this uill do.), and that the first 
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argument here is the generalized APERSON Modified by MANY. To specify the 
second object, ue would give it the negative reetriction /JRB.ATI0N and treat 
the situation where an object appears in this place as the sort of el I ipeie 
discussed in section 4.7.3. Uhat e lee can be aaW about relatione that are 
wished for? One ieportant factor is that the relation does not now hold, or 
at least the speaker bell eve« that it does not noti hold. If, in fact, 
conclusive studies of cophalopod color perception did exist at the tie* this 
article uas written, the reader oust imun that the author did not know about 
then. Our knowledge of these hypothetical studies would not, however, block 
the friterpretat ton of the sentence altogether. 

It is harder to envision seae pes itive restrictions becoaing involved in 
aleitateaents 'than others. Our "wlth-for" exeeple is probably •" fairly coaswn 
candfdale for slistateaent. On the other hand. "tfce exchange in (II) is quite 
a bit tali likely to occur than the one in ifj,* 

(i) Harry is a bachelor. — No, ho got aarried Saturday. 

(it) Jana ii a bachelor. — No, shs't a feeale. 
A Mistake about Merits I status is prssuaablyeaaier to aake than one about the 
ook off m indlvllual. 

For positive ail action restrict lent, than, our knowledge structure should 
provida a Mechanise to evaluate how likely a speaker Is to aake a ei stake. 
This Mechanise could take the fora o? a value, function, or procedure 
associated wl th each reetriction to cs I cut ate the probata! I 1 ty that the 
restriction will be violated in the given context. Us sight not need 
completely explicit Inforest ion here about ths J fkel ihood of violation, since 
sone of it Might be deduclblo fron ths Knowledge structure! If knowledge In 
tho eystoM is arranged so that certain facts are ouen lass prone to ei stokes 
than others, then the systsn could aeeuae that ^ the wrfter Is also ouch laie 
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likely to aake aiesta*eae«te of this variety. £f .cjufffi ^.gM a would only 
mtvi a* a take-off point, a iocs ptf tlp#|||E v ^a|j|fj^ Q ||f 1 f r ,^«||fr.4^tj^ la 
writing) or other siMtetseents froe the sees soucae •^$ a f||ar t our 

oKpectetlprivabout tbMM«H^ 

In teres tad readers ars rsfsrrsd to I1c0ersotrtf§j nhlcb treat* queatlone 

olbaUef and.doybt in tbaaesjajjaj^ 

5.5 Filling in the Rlan^i ; I^ll%j(«i^|y!^PC 
U* have Mid Jhe tin •ddjtjon to Ofcjdjnft batMaapDoaiJbJa _ 

representations, the cJeducUve eeeppnap* .4^A^^|^9^R|jy ni .iril |tM> 
the* has been laft lap* iclt In the *e*t. , Ja .tta^ iapleeenteiion, It hee-been 
assueed that such implicit information coaat in . ii^yff J^tjf •# ^^RflAfn 
that Mill, be neceeaary Jn»gener*tipn f|»djn^af|a^l#^,thfi irijl.not. 
Therefore, a Jot of iapJicit tofpraMJ^KJM^P»| ^raf^ac^ f |B^|» ,ajaj«tlc 
.. representation at aJI. ,CoiHM,i»* tbaif>f||»laai 

« flarkur f log nach Athan. / l1ar||r^^aj^,H f: A|b||% £ , t> ;„._..■. 
; Unp^gJMIjp,fia^£aris, ,./ .^JfiajNe^^e^ j^j^f^., . 
Every object has an implicit t few. f«|t^La|p^ ajpa> JfkftilllMB^y u Wt= J^iJ^frFHTH-. h"*^ 
is the historical one, we would have good evidence jft&**^fB£jhjp/t "• ■»•*• 
the trip under hie own power, eince Mm f. : aUjsptfaaa hod .nojt yet been invented. 
On the other hand, alnca Mndberyh'A flight nostdat** the l,njenj 1; ion,of i;f $!?e 
airplane, i t would be highly unl IKaly $gjt thfi <M|lUir%cH-*?*< jn any 
other wag than in an airplane* «no$nj thajfc ffpj^w^ Jfd^lwge jn Ma feet 
and that Undbergh aanVthe flrelM-S^oF^l*. nonstop eeljt, flight would be 
even better information, of course, but .that la M&tfr* pelfit Kara.) Implicit 
noun tenae, than, seems to be usafu| ^InfOTHt^^^laialiip^ipn^ ;i 

Since lap licit noun tenpe sees* utsfuU ej^fd^^lep/bavi f.pJac^. In 
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the eeaantic reprejsan|a*ibn? am&l^&m* a* ^^i^riitB «f Cfl IWMla at 

.1 . ....... 

Berkeley has Identified the oust t ion o* l«pi Ic J t f jif jn-p** I on as «h iepdrtant 
on. for translation- In tfM fipTai|hEhdH.fh. tiKpM^|^||r, 1 hdVe hbt 
encountered any instancs* whSrs v^jftsjl ^Jee^P&J °* •"PMcHfy 
represented in the ,sesairt I c representation In ,, 

(There aay, of course, be "eeergeney ; s1ituiW^ 

find a we I 1-foraed i^aneJaMon and aioht need t* ••*• addition* f cai Is to the 
deductive component, but right not 1 *ly considering casss in Mhlch tha 

generator la tuesaailirf^lf^aybliM* ifea^ for a 

glgejL licfltl XJCOtfattP't* c«t|nW)«i«III^J rtltn wfil be 

rao***red far generation and which wHI d#, H^lli interesting to eae 
whether thie aseesptien of tha predicts*! lily of explicit inforaattan fa 
correct. 1 •houtd note that because of thlf aiKwapilon, the seaentic 
representetfon »ft:th#aya*aa is \KfigbJltf "'lawgua|prbafahaahtr that lev aajSindent 
o» tha two ^angMaoaa involved. Me alght expect * systae ulth language* laaa 
eloeaty related -«|^tanb>iia^E^1lib'%a' \tg0^^^fi^l^yMli^^t\m In 
a eeaantic representation and to differ contldarably frx* fba 'Kaaairt auate* 
In I ta repre»entetJen-bul idi ng behavior, 

Ua can aey, then, that the deductive coapooaftt will be supplying lapllcit 
Inforaatlbn both far it* bun purpb*** tttil to ***Mo the*"' •eeantic 
*epreeenta*Vbh for the generator, 'lit be ^cii^fJltti oh' J the Iflfbraitlorr that 
m1 1 1 clearly be necseeary for generation and oak hou ir can be •uppl led. tn 
ea^t>an 4.^;^lw^eeuabed' th* use bt\'lillSPiC'^oT'%^nga' 'I Ifca t Nia*, 
location*, .epanjta.,.<'e^.. ; ttith «;e|»chan1*a' *uW** ^fHk br opoeed by ninety 
(27) , thl b eort of 1 nf or eat I an woutd be suppl iad by a rich default structure. 
Of course, a default need not be coap lately specified by the internet 
etructure. For exaaple, Me aight ehobse a general default for the location of 
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an object to be, "If there la no reaeon te thl** ot*ian4ee and the location of 

the object hat boon epeelf led jreyjeuajy, aee^«Ui}|e M! U In the.eaae; 

place." It eceoa fiir to iihw thit dsfaults Mill wauellu b« heeyJIy context 

depondont. 

In chapter k ho euggeated the un of ,.(HJ||BpE^..<or pIMpM* eltuatlone 

where a ro lot I on uat i Op I led by I to ergueenta. Thoro ooiook |« bo eevejsal 

poMibi I itiea here, Tho *4i?et variety JlppeeraJn our eaep(o paragraphi 

Ob dlo Ctpbolopodon Mlb»t oyf Fat;ben p^iojpfl. r Jo|^|c^ bo>owit. f , oo 
acheint doa zuo athdeeten for ditto Foroon Jit tlnta pTfawliw zu 
. .. sprechen. ,_ wex/^^:::- 

Whether the cophalopods oven t^fcfc^ur<rt$B^-.*nmm**» ** **«•* for 
theee epecfee, tnia'OpboorO to tupppr t a. f£J£L MDAsV 
Tho full relation has been preae/tfed directly^ ^ be lae^Jetely deduced 

fro* tho context, and tho ergueeo* of tho^olotiooalfltoteo »ato* aa an 

abbrovlotlon. In tho e*aapls, mo no»|dr OKpoo* iho defective eoejwnent to use 

information about reacting to iootdlatsly reformulate tho first sentence oo a 

question of whether ceehaloaode have a ssnee?ef -ooJor* Latop.v^hoo we eee tho 

reference to a color twin, hi con chock context and etajply tho loplicit 

relation(s)t 

WSUPPQRT 
A18 
(lamVE-faCULTY HOSmUdPm «WJRdB»o3EH 

Hore, A10 rofora to another •••ortion. Thee, Hltbieaed.uoderetendlng of 4«hat 

is happening in tho toxt, this aort of allipaia con bo handled In a 

etraight forward manner. 

For implied rot at lone that are not leeedUtsly supplied by context, the 

system Mill need a oore generol > mechanic*, for finding o typical relet ion given 

one or nore of i to srgueontt. Mo Hill eee *hJ* aoohonNpOpad again for 

coepound nouns bolpw. I euepect that thoej»ffiOB^ ncntemtM*l liapHod relatione 

uill be draun froo o rather reatrlctad«^ro«|i, *|lth#ol»|fe|ao like ^EXISTENCE- 
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OF heading the .IIMi .but .thip specuiattan U opt based en extensive analysis. 
Note that soap lap Had relatione are d*r » uneaJMguouvly free eurfiee 
atructMP« Jf^ ; a»«ii»>le, tha Enylieh »ff ; poae1&tB* ? la e^ufvatant to "If it^ ia 
possible"). JMa wouid expect to Undaort variety »n aeeantic representations 
for these cjeae, but they mm not Rel*v»ftt right neu, ft^co, theu would not be 
■ar.kad.iiittj..«#^P^. . ,__- o 

Finally, let jia eonaidar a use pf the IUMSPEC earlier that was mentioned 
but not dVacuaaad \p chapter 4, . |»Ui»an^^aj|^ii«| that are finked by 
coapletely .J^ioaj^^aiM.ralaiioni and^tpaajk^ 

relations <#HAVE4WERTY, aje&jftg*, 1^^^^^ Ja ^^.oup of 
compound, that can be handled mHH thi JWISP6C aarksfi Consider the example: 
. dls Hau.tOpsohlflawl - dl^;Sehlj||^^ 

<i0 « r ' *•¥ :jrt*Hfct***»t •* ebieet ;of-aat,tah lapHad 
• hare, affnen, to open) 
Hare, ho went to find a relatjen thftt la thp tunptlan Of the key, i.e. IJVOPEN 
#KEY dOOOR). There are a whpte •$r\vt ef theae functional compounde, where 
the relat ion 4 Jafjt 4a^!unaej«sl f ied. ...M>W*A<f&&to* ,,^Jad««2ll^:^oa#o>lant 
Hi 1 1 have to suppy a typical relation when given. Ita a^gumente. Nate that for 
the AFUNCTION-QF cape wp are no* asking far a whole nap" aechan lee. since us 
Mould expect * Knowledge •trueture to h*ve« trctfK) tlfes between an object and 
its function. ■ ■ . r .^ {> -' ' .' e ■ 

Compounds do exhibit relationa bstlds« /(FlICTIOfl^F that us eight want to 
represent by MJNSPEC. ^er-ippjaajia,-.,;.. ; ; .- ; 
dor Handkoff er • hand luaaapa 

(luggage that can b« carried by hand) 
der Kablnankaffer « ctcaeer trunkal tore My, cabin trunk) 
( I uocags that can bs used in a ship's cabin) 
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For the«e and other wore •••ocj*tio»«4 c<«^JMi»d«, it if no^c^»«r that there 
i» only one relation Jhat can dsscrlbs ths connection between the c<mpound's 
coaponont s. It sseas deslrabla, houovor, for the deduct 1 vf ,, ca npa n snt to 
supply a likoly Pfl*Mpn,or relatione to ths teeenMc fepra»entetion. Than, 
if thore is no sou ivs lent coepound or c4ass if Jer piu* noun coabinst ion in 
English, tho gonorator can use. the sonant lc -: rfajpaaatajinja to, produce a clause 
or paronthesized oxplanation to doot^lbo tho obj«c,tv . 

5,6 r O^har Contr,lbvti«ns oi the Qsdi^Hv* J^aponant 
, Except indenting w^th.thi.JUlB^^ la not 

•xpected to alttr tbs teaentic rsprsssnjstipn. |s»rCi^|^Pjacl^lt. hpMayin?* to . 
sot regiaters on assent Ic nodes MJtfclnforaat ion that mIU be useful for 
generation* In particular, thif A*tmm*kl*!m£&£m&l!d* categories like 
Ha 1 1 i day* a known-unknown and var i apls-valuo (131 . F I neiehe ices between tho 
categories rsstrict-deeoMbe and psasrlc-peMlcMler ^uouAd eiso be pade by tha 
deductive coaponent, as w» II aa- docis ieao acoittt.coi>a^arfnca% - na- tho/ a o nar ator 
becoaes nora sophisticated, ,thar%MJii;;no- i dOMt4. ; JN»>»'^ 
would went to add to this list. »: 

5.7 Deduct ivs Processing, above tha SeMence Level 
I n . order to . dac i 6s bstwsoA Boaantic raprsaOTUtioa^, tha d«*Mctivo 
coaponont will have to conetruct ..* ,PP#l-: pf **>• ,.1»^-?ati|Nt; Inloraa^loo*:, 
above tha sentsnce lavsl is ci^rsntl^ liK^rpo^ad in|p J^eeanntlc, 
representation, -sines it i* not get clear he*# s^K^J«4^a»^laji,can. ; ^Msad In 
text generation. Nevertheless, I think it is UerthaJiMa^Afi^flHfe^jQPl^** 
some structural aspects of our sample paragraph. ^nj^wejcan^eeei;frpa tho 
English translation on page 15, the sms^ J aparegrsph can Pa divide Into six 
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section*. ■fl'*iaa?d^a*eiia> : -«s^ 

1. An upward patntert raleUng whet la to coat to what his coae before 
... In the text "'* . 

. . 2*/Jlart?ft#a?e*ft#a4 «M^f^-|^iif L, ^'iiili|Mp^ 

A. Uhat rwiM Hi ban to activate throechanier „ 

5. Uhat goee with itt aeetu^arnr'ftg aeHona 

6. laeuei can <»pbalopad* parcel^ color? 

Im, ,\hi-m^ar^^l^^4^m.him- l : **v.JiiFd «ti(iir^^«^il^i|^ ? ^iMiS4arMi'.-«i0u I «!-■ . «Mf»«c t thea 
to be bandied usine? aaeei aJ Mnoirtedoe o* ^ way ^ I bo. a 

prooaae or aocbanlea* The sixth point -ha#* lawfct dlraiUy >a I ated to the 
rert, and we would not OKpect the -precieeweTiaiw^ptloi* handler to ooat with it 
dtreetly. The leaweef color perception I* hot. h ow ever, unra»i*a# to tno 
; o*he*^*4¥e*aac*iai^A;.fbe^b^ 

producer*. Ara t hay a >>e ca< or coni uMr ■().»> par a il var a) ^ Tha ><«th : 
• eee**on^: therefore* *n*ett i getas the Inverse of e>4e&»f$l*ti$ftH In thit 1; 
■ paragraphia - Tha^aaaaelationar « It* aatwaen aactfon al»< and tha raat of *h* 
paragraph is a fairly coaaon pheneasnon. « Siai .far aeeoBtat) anal I Inks eight be 
ueed to introduce Motor lea J inforaatJon or to aaha points that ara too abort 
to war it a naM pa p apra p b of tnalr ow* r^ can conclude .. 

that «hW# paragraph* '-a** ■■f0H^\-^0f0^T^mim^^1iit¥^9 topic, wa cannot 
ewpact 1 ' 'a etrl«t*y top-down, -©ne*top1*-i»eH^agri# organisation. One good 
heuristic aaaas to be to looK for Maoolatlonar link* nMr tba and of a Une 
bf: , deter'ipt fiiKor-weaaoBing-, fea^ara the a^ aaea«nta aliova^ Ha^ld be conaidered 
Hna* of daaeripUento 

Any attaapt to bsrt Id a aodet of the eaapla paragraph would also have to 
recreate the reasoning ueed. One of thS'pattarnw'iw*-*ea In tn* paragraph^ i at 
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(i) Generalization* X is I mum of Y. 

(li) Evidences Exawpl* where X it blocked and a change in Y is observed. 
This sort of reasoning pattsrn would be associstsd Hith causal Ity, •"f^J,* 
could ba cuad lexically by "varanlaaaandarr" ("causing" line 9 of tha Gereen 
text on page 13), "bei" ("upon," hara • causal re 1st ion, Mn«.U). "rtMngan 
von" ("depend on" Una 13), and "sins UlrKung auaflban" ("exert en effect" lina 

... . , ...-. r .-.',.-. ,-. „,.. - ■; » : s ■*:■,;< ■:?■* ■■-',' *i>-7'} * «T' *t! .. #? *»$** »n*d5 .ap^K'l"** 

19). A variant of this rsasoning pattsrn - uhara evidence pracadas 
generalization - eight bs slightly eoro difficult to handle. It is obviously 
easier to interpret evidence if we Know what it is evidence of. It eeeae 
clear that at the paragraph level, too, we uill want tha deductive co opo n o n t 
to handle partial inforeation at it is accusal ated. 

5.8 Metaphor ic Language 
Most typos of text that uould ba considered for Mechanics I translation 
probably would not contain phrases I i^^ WU&^&fTf***' "^flfc 1 !* 
store," so one eight cone I ufett^ ttsj^lty jo, h^ls^tapl^ic I sewage 
is not »»Portant in a laical s^stsa^, Th^U i|ouJg Jja^ yPlU**^-.,* 
unf or tunate cone I us i on. HetapN^.J s T^^^j^f^ \%&> £?&, f>M *$%■**** 
general problee of tha creative use of lengujge A J^mM^m?' * tal ^ '$* 
definitions of ths individual w^yjQ^ a^ s^p^ jp^ of ^Tp^lm. 
whi le transforaing others al.igj^l||. ; *j$g^Jnjf%$fy#fo& \&& l *&- ft 
deductive component ui II have to reason by nfjypHjhi^ll} ^^^ tb ft 5 WM* 
one. Often, ue sea a, phwioeenon , j^aj Jar ^^paja^ ^JFt.^frs Cpn»«<^. 

for exaaple, the W*$WNNJ!M f* W^^^fl^^W;^^*^ ,n 
a technical sense, as ,lt;is _usad.|^^, ; ^^^ *,.., 

excitation, or iepulM.^ifien^ %$mmm**™ 

agi tation, whi le in a ^ socjal si ^ t^ion iica^ mRli&toM&ha ^ ■^ h A*^ M,, » 
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this by giving three #if fersnt (hrf ini tlew to ths word Err ■ouna . felt thsn us 
risk sissfag th< iiwm ground that s*rlsts bstussn these thrss ssnses of ths 
word. A knowledge structure should 1m sbl» to represent ths fact that tha 
thrss ssnsss srs snalegouet there i» • «H«turbsnc« of • atata of rest by a 
pnsnoaswon : ti*et is wnstabts or Msre i te tabto In saoa ■snsa tletapbor ic 
language, then, asses to diffsr free other sorts of lanauays use In dagraa 
rather then jn quality. 

One pessife** difference between sstaphsr end noma 1 Isngusgs uss is that 
setaphor is unpredictable. Unsn s new astsphor is sncs w n tsrsd, wo presumably 
have to call in oar analogy processor to find 4ns points of siailarity and the 
points that are t***ri*eant.~ Once tha senses' ; of a word *rs known, however , It 
is not clear that the e*ossnees of tha slai leHtief ts as ieportant angsore. 
tlhars could 'he ths usual tosnef i U ©f a shsrstl eodel - sconosy of storage, 
uniforsi ty of f e|n s ssn tation, etc. - but 1 * 1s rat doer that these issues are 
relevant.* fh*e conirwt of sstaohor snd regulsr "language Ignores the point 
that language usaos hM to so learned, WW ue sneountsr ths ward Erreaunn in 
a technics I sense *er the 'first t*aa, «• are' •Iready prepsrsd with e sst of 
possibil I ties that can be evaluated to dstsretas which are relevant to tho now 
usage. To hsndi* **th eetaphor ic language and ether sera coaaon sorts of 
Isnousgs use, than, 4 deductive cospensnt Will need to be able to inspect and 
altar uord «Wi»WieW lor seas sods J tit thee) '/ and l»f) I need I*' ths genera I ' 
ability to rsason by analogy. ■-'■-■;-^.. 

% rotated issus hsrs I • that of "dead estephor." Sous cliches and 
itfloeatie sMpressrans hdva tost the frsshnass ? bf ihslr analogies, and ons' 
sight conclude that the origins! ssnsss are no longer relevant. Soss English 
sxssphss s+gtrt bs •That takes tkst cake!* and ««e uaa bow led ovef.* 1 have no 
Idea of tha original ssnsss of thsss, although It would probably be 
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interesting to f ir$ out. Ths point hsrs ii that mo can stl 1 1 um euch phrases 
•van whan thejr prtginf | 5paj^J.fOj^ara,MD|noj«p :l O£ t Jo§fcJggp||tafu, For *"••• 
aorta of dead metaphors, a systss should prooably book *** phrases directly to 

example would have rcM^i,!^ t|i9,-«9!!P» iP^PEyifP^lt^lll^llMfylA 4 *?^^! 1 !* "*-. l| P , *-^ h- 
meaning of the ■ ^e^jn^\9 .^l^^tf^p/i^^f ajaa ,fa^J$Mltf astounded." 

It i» not always M»y, however, to t»IJwhao J Ht«P*»P<' i*«*aad, and I 
think we should be cerfful not to throw away top auch» CpfldfB the elfcuation 
of prepositions. For tha purposes of c^Ja^ |, t^,jaa|| £p^ aaBoc|aUiig a 
prepoei t ion ui th a verb or m\ .e|.|ectJYj» wiiygff $ i <MMat' iWte 1 1 M ona 

!°** 8 „- d ^!P • n °M0^ ,.. h0M IW'.i : **•£• !• Sf^ t^^PfM'Bf 5MW n 7f or < -.^t«* 0,c ^« 
Consider the example ; "ebhingsn von. "_ Ua , i|||i||f n .tr«!if 1 1 Jtf %l |b i |^a ,," djappid _,on, " 

but tha literal weaning if .-"hang, .down ^SH*": ; ., ^af f • ^^d^Ra^dJfnga) 1 1 • ; f orj|M..I.«ta)Cl 
in terme of ■ phytical aiti«tlon, . Jf^bjfp ^.fr^i « andj^en^aasy^lio^e Y, 
thia will hava an effect on X. Nots that ihj„|pjl|,a|^"dao^ s ,oii* usee • 
a i a i 1 1 ar phys i ca I ana I ogy, Jaffa \ % J* • s «Mgb§l MU$ B SQB&-£i a ** P*% °f .^W 1 * * « 
If X i9 Bet on Y, then l aotion in Y will *f|»fit.&49 well. ....,> n)p.l«lMt. 
anc^ator of djya^jl * djfifindJCft - •bare* tha baog-doim-f roe analogy, but that 
is irrelevant here.) One could argue that tbe^pal^afie^ 
dead Metaphors, and that hanging things doun and piling things up have no real 
connection to our thoughts about dependence. The aore closely one studies 
prepositional uss in both Caraan and English, houavar, the aore apatial 
analogies can be found. Thia situation seeas to be especially interesting in 
the I ight of flinsky's suggestion that a single Mechanise could account for 
both visual and conceptual processing (27). Prepositions ssem to be one aore 
example of the intimate relationship between visual and conceptual processes. 



§kf< ' ty*fNM»taa :tt# 'festive Coapor»nt 
The ie p leeofHeitl-on uwi «n etttreeety •lapta WQghaMeo to by-paoe 
understanding, fct If deecrlhad ham for ooaptateneee only, eince it «HI be 
obviove that Mtch • aeheM uoutd b« lapr«ct!c«t in • H»rktng »y*t«a. The 
preyraa reouirea IMA Hie ueer undiroMnd the t**t and eupp-iy **»» tnforpat ion 
that would ordi-nert ly be euppt led by a defective ccienint. By the tiM al I 
tfcle Interaction*** taken place, the after eoutd heve tane-cince translated 
the towt byhahttV Jhwerthetou, the deductive by P— routine doee allow u« 
to get an Idee of hou the-ayeftea Mattd behave to n uere aire eeaprete. 

After 'the M ea nt te coeponent has p r od uced the poeelbte roprocontationo 
the user \* aehed to ddetdn which eeareee»tet)dn'he would like to eend to the 
generator <by typing k« nuober), or Nhather he ueutd I Ike to eee the 
generator try *ta hand at all the peaatfet* representation* in euoeoeeion (by 
retaaondlny ntU; #er etch r«pre**tt*t Ion that la to be generated, the ueer 
i-a then aafced to supply concept aarteere for the node* with WMSPEC. Final ly. 
aodee for noun prouee MrJt«d ClWt e^ f»/MttIC8tJII Ore pi^ew»ted, and the ueer 
te wh«i to •pectty other nodes that ^ ccreferant. thd other inloraatfon 
Mentioned In eeetteh 5*6 \» not reoueatad rtght nou, olncd the generetor ie 
«0* ''*>hew*#iad enough to uee it. 
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Chapter 6 — 6«ner»*lon 

6.1 TheProcM* of Surface Generation 

,6.1.1' Input to the Generator 

H*v|ng completed •yntictU ! #Hl •••antic an* luaU of an input sentence, mo 

,are now #t the point Mher« generation of an^ngHah een-tence can bag) n. The 
firet question i« Juet what th« turfac« geoaritor .^rtpMlW^l^w l» 4H AlipJ^t, 
and In general the answer to this it ■ ^p.%,-^1 |A|^iKtfcr«« ADV^ tfo wot* *W»* the 
•••antic representation that hat M}fWto?*l^&:JMtP4fBt!V **• WWr» t,c 
representation, every «f fort h*« sads to Include jffrWi^tiMlaaspitla* •*-. - 
possible, with th« hop* th«t thl. wogld !»• »uff Ic^njt ««r thf gwi^a^Vng 
procsss. as ul II be dlwussed bslw, ths • o a snt lc repcasontaUoo U In fact 
not adpquajta for every eventuality, but it atMJ ©onstltwtsi the M log Input 
to the generator. ,,. # - 

One could* auaaMap^Hhathar Jthfj >aa f pt tc fi rappaaa|»|at|pB i^lt, JWaj proper 
Input far generation. For sx*spl«, ubsn ^enslajAag Mri|tsf Caraan Jjitp 
English, I firtd pjsalf using, syndic ouldancs. ,, ©na locf #t • prenaalnal 

,clauss liks "di« untar der Haul liapandan,^a4la»" JM4fT*i4M» "*** HP#P thm 
•Kin lying csl Is") and Velative cjaus*; or !tha|^ coast to aind. Tb I a Mae of 
I anguage-dspsndent translating rg I •• or heuristics say be a per sons I 
idiosyneracy. or iiP*y» Infant, be ona of *hs sborUwt* that paapla oftan 
use when translating. At any rate, rulaa depsnjMng ^j^^il($iw,0 .,*****• 
to target language uars not ueed In the trei^fjng^sysjsav. There seeeed to 

be no caw where this ust oscsssary, ftlnca il i il^h9H* l 'IB^W^^^i^ f ^ uim * 9 
semantic rule* correeponding to langiiapa idaptnrient syntactic onss. 
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.6.1.2 Ceaoerino. Goaeratora <wd Int w or et ar » : 

The eurfaee eaaeretor um *cttten in en oMtendod *er«lon of PRQGRAfflAR, 
and In aeny M*yr^W«§»H£r »o tn« IrtU i ' i Mtfti 1 . Sinoo the Interpreter and 
generator were toui J t for two different watwef * Jaiajfti^diV m would not expect 
the* la *»• Idanttaaj «ft«"rtK»ifer"Hiirl»f^ «** 

even kf onfff a****** (wi gMoy i aara t wot »arf fc ar o , w» wo»W not expect the 
aenerator to bd\a^aa^or*N*a^ki#ei*ad*«^ Thte la aeciuee 

there are different *no*M and w*noun« fer^tHi two a ro ai lM* . to the end, 
■"aiaili . the i#* tatpftftoa* and tno -oanareler *w ft - hala ^ajM-liiMftlfatJa cholcee, /etace 
bo«t wlH bo ii»Jng >4noul >tfc 4nforo»t»on >a»ad an a^awatoatc Chorootor Izot ton 
of tha teneuaooa Vnvaived. Bat m* #an*nt fM«la^lioW«iM ( r ae evaWebl* at 
dl f f oront t law far tho two arooaoMo. Aft I nterpreter bay haoa to Way^ 
aavarai praoaaaf«a>^a4aaa oatH a ieitfhaabliWlg;^^ 

alternatively, U ooy try ono of the peeeibliitle* endbeehapK VI generator, 
on the othor hand, -dee* not noci tte H tg nava to oonaarn Haol f wl th tho ieeue 
of aeblgulty at ail. tinea ett of tho inferee* ion H need* I* available In 
unaaMgviowe fari fnoa the oeaentlc reereaairtetlen. Were, I eo uolng fecal 
•nbtgulty to aeon aahteulty that alH a* raa%fva#^iW *taa a 'perear-hae 
f Inlehed syntactic anolyolt of a wnt o neaj eaa'4i)'M 'tta4HfV aMt«a»; af»l la 
the cheieee"-eaaar***-aa*e^ tha relative 

; I epor tonce ef the''' Oho Joe* m\\ I dtf ferv Ihf 1 folaraalFter f «•§* - 1 ta'^ittf a&Qa of 
greaeet Icat ' r odu n dancy and, In 1 ts hotter ooaante, ■iiJjhW'HIielfhoatf 3 **' 
dec I do uhlch choice woo eed* by tho author of awj#iiraii^ on 

tho ether ''fuaaft ^ba*a enactor ixet I on d? th* aeon* ng lht*hfieY- u: ft etiet^eake 
ehoieea ta coeeunlcete He o a t oaj * iff t em mty andr with iuefc. graceftiny and ^ 
unoabiguottely. Mo can therefore expect the ttw ereeeeeee to differ In 
rotative tlelng and eapheoio. 
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B.1.3 Translation «rt,Geiieri J ^H^P^ JS^P^^l 

Several tiaes a I ready J ,,^^f^^J%J^II§^^ ( }%.fW^^-^ 9 
;surfaca;.|anaratpr. and I aheu|d pej^cH^fy gp|J % a#en. %*"»•? ,£ 
particular, how doas the generation j^ptN|i ^avwlailf^iti^ *^ l f^ ^SS?!?!?^^ ^ n? 
general process of mtI ting te*t pone's oia| Jar||i||i? Th| t||f| deference ia 
that thej^nerator In tha translating suetea Is start Irwfroa a highly 
specified ssaentic rsprstsntet Ion .which Is In aost caaaa JI Mill dleeuee the 
•xcsptiont later on) unchanged by .the J?J£<pf # '- , j^^* 1 * ,• lot of "daap* 
organization has alrsady baan dorp by |ha tiee such a MMnVi ; rapratantat I on 
can bo produced. To writs the original paragraph,. for #*aepU. tha 
information had Jo ba asaaablad f*f|,'J ? J< ^^itE^^Sr?^^' •. --. 
Patterns of raa^rf ^and^arj^ant ,ha| f>JfylQg^^^4^ J&&,,*" 
rolative iaportence of different inf«t»o^ had 1*1^ ••^•'^•W !,;«*•• 
steps are not neceeserily IndaDondsnt. but eau be IU*fd in rather coejp I icated 
uays. The surface generator in the translating ayetae does not % In aos^ 
casss, havs to consider these choices, since they are already specified In tha 
semantic representation. In this seme, then, ganaretlon for trenelation ie 
easier than its ewe genera I -purpose counterpart. 

I should nots thet in character Iting generation for trenelation as 
genera I -purpose generetion einus soae "deep" stepe, I do not uieh to suggest 
that the semantic reprteentation used hare necstsarllu repreeenje an 
intemediate level In the general generation process. In fact, 1 suspect that 
in a genera I -purposs generator, «e mou I d^ not MM|,ta create euch a highly 
organized seeantic re^ar^aa^taiti on^ titjil |ai^ s^a^s^a||j^ ^a^rj top. theJeKlca^and 
graaaatical level of the target language. It ueujW npt surprise ae, than, to 

gsnerstor end the^generator dascr Jbed hsra, ijK ^^ _,_ _ ierf4 , 
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In addition f» :; these ditferendisV «*' cart day that if generation for 
trans I at 1 on «**> lid" odrilWdarad'' sarier then gsnardM-purpdds generat ***** in 
another May it le hdrder. The gemratbr does not have :# dac Ida Hhdt to Bay, 
but the other half of the coin is that it has to tafiar Mhat ia generated to 
the intent d# the ; drf# nd? tan*. nW'tl&e tddW'aJr' couras, uas written In a 
(Afferent lenguagd. If we can say that tanguagas ara organ izod to convey 
eeeniny, up can a4ao say fhdt the organid^ is 

influenced by tha facilities aval Idbia in a given language* Word I choice ia an 
obvious exaapie hard - thera la oftsn not an ai«acriy eddivaiwit word in the 
target language. But the prdtolse actudHv aanifset* itself at si I Isvsls of 
I inguittlc organisation. ' When a eleaileh' between ' languages occurs, ua often 
have- to eodpreirta*v suspending one goal to aehieve andtne r. It Is true that 
eiaiiar coaproaieat duet aleo be dads in general - purp os e generat idh, 
especially in a situation where one !• parfiedterty cdnoerned about style, 
ffhd worst case hare is translating poatry.) %* need for such coaproalse ia 
Much dore fred^aht in translation, how a vsr , and occur* even tn cased where 
style ie relatively uniabortant. 

6.2 the fngiish l^aaaar 
6.2.1 The Baelc Shape of the Generating Graaaer 

The Engl leh generator has three aain parts. There Is a group of Engl I ah 
syntactic specialists for clauses, noun groups, prepeeition groupa, and 
adjective groupa. In addition, a set of routines m lata to buitd, aaintain, 
and inspect a "generation tree," which recorda the progress of the generating 
process to date. Finally, there 'trt definition program associated with the 
concept workers. Only tuo standard definition prograas mr* used, but mo shall 
•— that they of far a great daa) of latitude In tha ford a definition day 



^^m^,^^^^0^f^^^^sm^g^l^mmw*^^*^ f**-"**'-*^*'*-. -'•«»*«•..«'* %&?"• 
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take. The generator translates, a ..« i«*i* seo^sfa^fkl „a>^a%.,:aJtha>gh" there »• 
nothing to stop it free using^ ij^ofe^lo^i^^^^l^iientfifl :Of,a 5 e|MSta«^|,. or 
froa inspecting the taut thst has slrsadyhee^oeaaraiedv 

Before going on, I thou Id «ake • feu rseerks on the ecope^a#.;tha- 
generator. The systsa hat only a feu English dl*tie*ary definitions at thia 
point, but the routine* for writing definitions are quite general. The 
English syntactic specialists are net a« e*ta«*iva a* Jtbeir Geraen 
interpretive counterparts. The reason for this was basically ea* of tiaa, and 
extending the breadth of the generating routines would be a stratghttbrMard 
process. This is clsar first beceues the SKJetlag pwasjsaao canr gonOrsts 
moderately conpiex sentences: subordinate clauses, conjoined atrueftafes, and 
rankshifted noun group*. Second, the asior gap* in the kngH sh generator . 
aost notably that it does not deal with question* or cosaends* can *k» plugged 
uith code that mil look very ouch like ths decisra&v* cod* that has bean 
mt it ten. Declarative, interrogative, and 1 soars tjy paths tbrouajht the 
generator should share a lot of cop e on code, aefetboy da i»>*ha#ai?aan 
i ntarprfrt iva greaaar, In- cehlraetr *© swtsnrijjig the breatKh oi 4hB, generator , • 
extending its depth, J, a, givlngCit tha i*^|^iosa)a|«sis»Fai4iifoi!Bedfc 
choi ess. is of course a aore dtff -iouiMiNi^s » -W»f eyn tactic types that were 
implemented, however, pose mw& ; qw*Mwki69r»*>9W~U aJajoa>-Jn language 
there are no truly "simple" exaaples. Finally, the routines for generation 
tree construction are fully implemented, aML,thsy ars the sup ject ef the next 
section. 



6,2.2 Additione to PROGRAmAR 

The first step in building th* English component was #» aaka tuo. additions 
to FflOMArf1AR f A mechanise was necessary ier but iding and maintaining a 
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generation tree* and specie! fenetiowe ears held a rf tft apodf g ihc nodes to be 
added to the trees. - ; 'fNr4lret- : 1l«dJr ; «»^Mr-^|«tl«i4t^ i M^e one - generation 

tree nodes are 4iM&V&itaiWlW-^WNit&& ihfiieiMohr ^ 
FEATURES 

- The *|p>tecfj« feetouree of «he<*#t - 
fl@DT 

If th* rode i« e terntnai one, the res* of the-sdrd-- 

translated 

• Tha phraae epast ate J for the unit 
DA1BMTER5 

Tr* daughter* of the nod* In ravorM order • 

^ f ft*? eseant ie node far which the unfc* itaa generated 
The inforae*lon e* e neeeeag as reed «sing t^ and 

STI^e with the node es ereuwswt. More, the "f»- stands fer EngUeh end is usod 
to neintoln th» di«t tnct I on osttwen thle* generation tree end the Cere** pars* 
tree. Global var io bl o s anolooow m 1bw$>iH&ni00t&. 1 *^thii1*WMmt* 
&4, - t**E, *«#*, ,. ttim the eavJoea eeeoinea. The aasaaaa ^er 1 ab >e aeeheni aa 1 a 
theaeae, and •©> ulth »o»s atkh^ cothn ar« *h» aettie) fur^lor* POP and 
PQPTO. 

' A node- i# added to tho oer^atVon tr*» euch a* in parting, and the basic 
function for this ia TRAN8L. Uhen TftANSL is called a node ie Mt tf , and if 
TRANS, succeeds, the node is added to tha tree. Juet ae with PARSE, a cat I to 
TRANSL nay opacify the naae of a clause or gfttieK^^* call ; aaSpro«aieat^a 
le*te»t -vat* lite htM* or *U. Unl Hw WWBE, th. cal I to TRANSL also containa 
the naae of »node it> Ins —antic i o prgeiiilal ton for which the generation Ie 
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to be done. Alio unlito theparee tr*e r th* daughters of a node on the 
generation tree do not havs tope accuaultted if* ordsr ,,.■ It is pottlbJt to 
•pacify that a nooVbo attacJiad an^Mfaara >n. the Ha^of d f Ai | ht ate-;el the 
currently active node. A call to TRANS- with a claueegpr. grgj§ r iMeie caueee a 
eeJ* to tha cwro^>oiKJlo^EnQlli^ow>er»*lng aoeclaJlat, and Its aucoaM or 
failure deteretnea the iuccaM or f »i Imto of TnAMSL . H TRAaSL It call ad to 
oenerata a eingle word, Jt e*ec#tet • procedure eaaaciettd "**" *he concept 
earner of the 0S§» RSS, or.PSB fo,be trane^ted. Theae ipd4vidup I procedures 
will be dltcutted pore fully OsIoh. 

The generating rouilnee Invoked by TRAtsJL : .|o gentr ata groupt look ,ve*n 
auch liM thalr interprttivt counterpart*. Th* baalp tt att at n t type It atlll 
the PpfRAtflAR branch function m \\ although v t UjM^l^to«fL.||B^aa>^aa«si^ 
alnce it. It not fil«lf J»W the |bjpBe|$aa£braBajkaftayJ^ 
generation. Tha ini«wtlon i funotloi^Jl|a 1 C%| fad |#£ ers analogous to 
thota infA^. ap aj-eF^ FQ-f, end rt I ated function* that add J alpraj* I on 
to a nedf, At tbt outtet, It la ae£ coapJatalg clear that It it necettary to 
conetruet a generation taae^ i pu*.beJd,i^ the 

location of en adjective or object of • proposition allow tha graa|y|r & ,t§ ■**>• 
decisions bated op tha a|r§aj|^, of.. Jja^eae^^ A 

tree alto peraitp aaay incorporation of -baakjua, should tb* generation pr oca t* 
run into difficulty. 

6*2.3 The. Ganera ting. Process 

Uhere the intarpraler ,poved lft$,%9.r£0t.wt§ir. i * .aam^a||Op,, ;! tha s gefa)rator 
poyaa basically top-down through tht •eaantlc repraaaAtatlorv That I », the 
generator epveft top?K*oun through tha UNKAl^ 
after participantt hove been trylattd. U tta t i nf a t through tha BPCHEU5RS 



links. An SKanpla ntghi b« th« M«4ft*tu«y to ttxplaln th« transitions 
involved* Figure M «***«• ths ••«*** J c representation fort '■ 

0sr Csphelopod besi tzt gel be Chr«sstophor«fi. The eepbSlopod Hat y*f low 

ttem*to&iK$£*'- *•■'-■■■ 
At the top Isvei the generator ctflltt {«8«L ttAWE mj6R TOPLEVa SttNOOEt 
RSS1I which in turn c*U« the syntactic OfMCrloH^^t @M&^ $»nci these ' -E' 
tags ar« no doubt distracting, I will Ignore thee f fee hers on. Any routine 
awtltined in tht* chapter .■ \t p#t ; of the^tnylleh/edii?^^ olftorif M 

noted. The sain relation (here* RSS1) :-*$fc always the topeost t1nk In the 
semantic representation, so It i* specified in ths clause oaf I. The first 
decision in CLAU3E ie whether to generate an interrogative, faperatfva, or a 
d*Mar*a-t*m eta*iaa^ register of these in relation ie 

checked for its receaaehdatiom Since here the clause* type ie "steteaeht," 
the generator folfow* the declarative path, ;, >#liel^li|itag^OEaJllnViTlWl ; n the 
current node C. The tarT8r\JE¥6t indicate* ^het '/the : hldS r oefSriied* uilil "hot 
heva a parent on the tree. Next:, GLAU8S esiis the phrase routine V6RB-WRA8E 
and which sake* ths cell (TRANBL VEJ» ftvfr StlOE, W&jr Note that In 
English, as in German, us uill follow Hudson end speak 3 ef a ^erb lihraes 
instead of a soty group. The generator trans! it st the aain verb first, eince 
it is the key to the ordering of ths other const itusnti in Ins cYiuss. A cal I 
of TRANSL with MVB will rsturn ths verb nods and also returns * participants 
list with ths grammatical structures that the verb expects for its 
participants. Ths list has been ordered, taking into account ths objects 
required by the verb chossn, Mis thsma of t»i lerttett^i, arid whether the verb 
1 a grammatically pastrive or active; V^eWWtttlt should also go on to generate 
any auxiliary verb* end »up«rvlse tens**, but only the steplsst active and 
passive construction* are handled right now. e 
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HEAD 



HEM) 
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Htmirn 
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0SS1 
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.i t . 1. ni l 



MiWMP 
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P9S3 
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4UKJnBBQfV 
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t 'l fraw nnjll w i f i 



P9S4 

THAM-ONE 
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Figura 6.1 
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The participants I \ st 'returned by the sain verb is In the fore of a 
series of calls, aha" the c I subs routine heed only execute it. CLAUSE does so 
now and eel 1st 

(TRANSL NG rtli «DI1 8EF@Rf* «*•* ^ed#> SNNOOE* tJSSl) 
The &EF&RE nero Ine^eafcee that this nedhlgwiup, since It will be the eubject, 
should precede thl verb Rode just generated, the feature MOM Indicates 
noeinatlvej other English eyntactlc cases used ere GBJeciJve and FOSSessive. 
FULL, as In chapter 2, esens that a rartkehlftsd noun group say be generated If 
necessary. In irineUtlngiiie regroup, tha^eiAt*iilBl ftr*t consults the 
COREF- regieier ''-ee^ler'^rtfartnN if any. Thf§^f i^eejfWa a ward choice 
hi II often depend- en the word choice Made f or Ate corefferent. Froe here NG 

caiis <transl nmmmmt ossiM it i* dear *m#*im*m nnw to 

tranelate the noun in* noun group iflrst, since Wb*hW«e-4aU tend to 
influence word eho4«a in the eodlifiars. Th1»4»4eat obvloufly true for 
coapounde, tfUcfc bay absorb sons, off the eoajiftafj*' For any eort pf noun, 
since we do net *eve a- ©no-tp-ono; eapping be tuo o n ii or a V end eoncopte, the 
generator aey IteWiP fe a* toes ehilf fling to *f mm/mm** betue** noun and 
nodi f iere. It eeeee aee* nature* fo eMpress these requireeants In teree of 
the noun to ba*e#Wied. At any r#te, ©nee-the^ou^ hae been tranalated, NG 
goes on to trarfilelio eodl fieri into the appropriate ipre- and poet-no* ina I 
structures* uafnt r»*a* based on seaan'fte cast to do the ordering. Our 
example ■'Aer Cephalopod* has only a dettrelner, which ie generated froa 
relations with eeee SBtKTiON and fUMBER, taking tha category glvan-neu into 
consideration. Since no relations remain, TRAMSL can collect Ite reeulte, 
"the cephalopod,* irt the noun group node under rHWSE. Both the eubject and 
the verb heve been genereted, so TRANSL can call the routine AGREE. AGREE 
makes morphological changes to ths verb root eo that It will agree with the 



/ 



;. ^ p&*il*piinf°W# i priffi^^ '«"&* ***•**" ' "**" "•'" **" 
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subject in per»on arrt ruiabsr. htott that In Enollah, ay M H Tt t 1» aucjh lass 
coaplex than in Geraan. ...fort that AGREE has • auch aaeier tlae than Us 
interpretive c^tai^t JJe$f£, •Ince It doe* not have 4p ^P^Pfr^'nQ *«*" •" 

unknown root. .. ..■!■■.■-;,,■ , ; ^ :r '.,: ; v ,---'^>^ 

CLAUSE is nou ready to generats .ihj| ,|»|<t ; |»nti^ g ln Jf|» r ?w^,j¥>i|«^A 
•hopping list, and It Jurns out .that -_!if.. : lffa^ to t^^|^j|i^t c |^|J^ci$. ,,^J|pt •»«««pl« 
finishes up at CLAUSE aekas. a second .ca^yeJG. ^l^flip^^fUIJL.Q^i.. 
SmOOE: 0SS2) . NG m M I oenereia th^ ; ag|n. .fHfMrat D3S£, Jhen take i?ar*, of 
the d«t«r»in«r, and finally call <TJ||fgi, JfMI |f£fP^ fVfl ffljjffl- f|**PA- 
RSS4) to generate ttM^a^Jawtlvaj. j^jb^ ^,||fha^| .-|^. 4U?fle^|*J^^ •?& «^M ,f»^ 
rationed, the generator gathers \^^£W l & %&tfof2^Jl$? y &<& tha 
constituents, wakes a PHRASE entry in the clauee node, decidej^ojji punctuation, 
and returns f roe CLAUSE. c ..,„ .. 

This was obv i ous I y § re I at I ve I y a i ep|| sxaop I tjggfne^gai g«|*jN|t I ng 
graamar can handle wore coepls* citst. B«fcr* 'l^tn#1Wa^ualft»ri^, , however, it 
■ ight be a good idea to take a look at tns aschantsa for wortl choice. 

6,3 Translation at the Lexical Level 
6.3.1 The English Definition Routines 

When TRANSL Is* called to construe* a terairial n$de In the generation 
tree, it looks up the concept ■frker asfoclstsd uith ths seaantlc nope under 
consideration and then retrieves an Engl |s^,df|lgi| ion r©A»tVn»4Pffl*"» 
concept. The dsf initio rout ir^t ars rciughl.yaDalogQut to the eeaantic 
specialist routines ih the interpreter. HkereTtis^teaisVtffc special iate 
Inspect the parse tree In order to build a eaaantic'raaraea^tatlen* the 
English definition routines start froa ^ 1 wt^%laj^r9Mi^l§0j^fPd uee 
special Inforaatlon to not only supply a nods for thi V.traa, land 
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hence a translation! but al^ao to »p«clf» oth«r iibctea that uit I b« required If 
this trans I at Ian la to be uasd. The, syntact I c shopping f i at aupp 1 1 ed by 
definition routines when a relation it translated is analogous to tha global 
varlablss bound bu ths intsrprstiva sMMntic spacialiata to supply ergueonts 

-■to, r^hitibns,,J'ft^^ :; i»aaait»e raara*a*itaije]h«>s- * ■ 

There Br^ : Mfcue-.*tajfderd 'dlctl-Oiiani- i? «MI«***a»» flWiSli»sl^Ut for translating 
semantic relatN^tjand TIMNSt-QBJ^PROP, for translating objects and 
properties. Tha two routine* art quits •Jailer, except that aora inforaation 
must be supplied to translate relations. 1 1 would probably be helpful at thta 
point to list ths Inf creation that can appear In s dsflnltlon. Starred 1 too* 
appear only In definitions for relations, unffi"fJia real lay "be ussd for all 
three classes. 

ORDER* LEXAGI or LEXPASS 

TYPE* Nttst* CMC, TUO, ate* This spscifiss ths sntriss in ths 
relation's linkage register that say be left understood. 

ROOT tha root of thjt trj|»sl*IM ^ > 

FEATURES! ths part of speech of ths translation and then a list of 
other features, which are not new required to eaten those In the TRANSL 

UORDj specified for irregular fores only 

PARTlPICIPANTSi* a variable noes or tee 1 1 procedure the t specifies the 
participants Met 

...'..'.,,.. DJNWTATIOliJ* currently only one feature and optional 

PROBABILITY! nuaber froe 1 to 18 

CONDITION: a procedure that suit evaluate to non-nil If this 

definition is to bs ussd. 

COLLOCATIONS i s list of parts of spssch and root, of words required, 
Thfrs is currently used only for 'pPapoamalfa/'M 7 ^^^^* 

PROfiHJUREt a procedure that is sxscutad If this dsfihition is selected. 
It allows a concept to be translated by aors than one word and does other 
useful thTng*r ' .--r :.■ . m m ^v^ . 



j**., y^v,,, . j , ;. __ •»_ ■ gi | igHj ^i^ g a wi nj ! y 3 g K ^ fi s,-^ „ uwi j ffl i ^roW B^^t^ii-ify^-^ ' .'iff "V* «.""*5"*"V-«*« 
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A cal I to a definition routine wpW rp^^^^Jpf^jt^T, %^f\,9* m**** lm 
dot I n i t tone, sech e>$rssssd I n teres gf Jjajg, , P!*P"|!P £V| &*& *■ • "■»?> , Th * 
possible def Initlone first go ^jOMg^m^J^^if^^^lf?!.^,' #4 ,l !^i ,i ?*\J^^ d 
on the part qf speech ri«iri<t }%JH JWpt^W* Wl^^^^W 81, 
a natch on coonotatlonf Uf 83NNqT^TJip f| i|j|y^ it 

Must aat;e,h th« connotstlons In, Jhs, ; |g*Ban||B, J**%| JHfl.'&l* J^ty'SMl* 1 ^ ■$? 
COITION ^ocsdurs^f thors is *|8r>« 4" ^^V^ 1 M9Jl9N f^^5J^li^ M 
are a I bo screened for CH^t ,^@9*^R -fljt *!©!>^SU iW^x^Q^^flS^. PPpfs ^QMPit. .. 
•te. ) 4 agreement, which . uora. diacueaed^fn aac|lej| v ,*.j| ? ^ A^Fa,^ ^fft 1 )*^"«*W 
scrssning, a definition with tha highaat ^p^y^„»f^^^. WF* # 
ar« several definition* with this probability, the first em encountarad ia 
used. In the English dictionary, the probabilities era e*ejreeeeJ & e^;f*uBjbere 
froa 1 to 10, and reflect a rough «tt lists eV%$ ^tii uorde 

should be tried. Probebi I itist cwil4 o^^oU^,^ «3 ra*a£d as seal I 

proceduree that check context and rsturn, ao aspFOpriatfi S^fNM"*. but no 
refinements havs been eade in this dlraettdn^ tfHSfc 5 been selected, 

its feature, are added to the feature. ^iijf.^gfl? : $$*£ > definition 
lists that passed the preliminary screening are prlllil^ the LEXALTUST 
register (I ok lea I alternativf lii|t)^on ths % .oode^ ctse b^gu^^^cee^earu 
later on. If we are Arans let ing i ? pjfti|||. lt«JfA|||i^^l *Jj*|M!lt «ipt ia 
also attached to the node. s ^'qii 1 !^ ? t ^dEW^e^, t ^eB , ^l^48IP^2.^---»W.V*^ — 
to TRA^, uh|%finle^^^ fl'Ma^P^'m^'^^ 

add the 4i nejcessja^y,a^fng < s ni for ii ^| l||[^id|||Lj^?. , ,,. nt /i!>?i ,;,,., 

6.3.2 q«aJ^Mr, S^^a^Hor^s.^ ,. -,.;.-■*- »-• :.w -.k^-.. 

The last section presentsd the siaptest ease, Hhere a concept jjaa^jeajg. Je 
translated by a eingis word. The definition rout I nee Bust eleo be able to 
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handle the case in which ona eaffcsr. toecoOe* several- word*,' and the cava where 
sever*! Interrileted earners becoee one uord. For situation 'one, several 
Mecharilees a¥e- dvdttableV If tfia' aaHUr 1« a relation* »w COLLOCATIONS list 
is used for pVapeafctlons and partictes reauirad do vepoa* adjective*, etc. 
Tfce definition procedures a cedia tftle Mat, and fneart trie reaped word' In 
the correct p(a<» tn the particloeote shopping Nit. f# ;i acrie coaplicated 
constructions, PROCEDURE 1* used. A* sub*, l fe# tna < aejhd *f 'dwaajila, that the 
conceptual emictut** contained tits edrfcef JPJGFRfliB, and ue wanted to 
translate the" ihaileh e» 'nafc* friendd^iifiK^'lny'iaift.^de- thlo In the eyctea 
is with a definition of the forei 



#BEFRIENQ 

(TnVWSL-flp. ( «£KAGT NO*; s«Kt 
PSIIWESt <vBW tflB) 




1«W l«JP!LM.thU, 



GnWt ((Tgga. NOUN flUP OtJUP 

^ i . _ ;W|U friendM , 

SffJOOEi <seeent ic nodi of ''tne r aecbnd arp u as n t of 

mm\m>) 

Here, LfXACT aearii- that the order of the Sroueents of the verb corresponds to 
the order of the srgueentB of Hie csondepf aarfceW ''loiet a**ne that no eerker 
arguaents have been left understood, arid ejfei ie tils root of the adin verb. 
Ohm** *pHMm I • produced by • dafWitidh, tn§ WdrH lit the WIST position 
corresponds to the part of speech TfWHSL would be hdiiinf fori In this case, 
the oaf I would have been (TRANSL VBtt MVS). Here, PMIT1CIPAMT8 ie eet to a 
variable whose value in the sustee is a sertel of edtllr. fne flMfftClWllTS 
would bet 



,-K «Vn -&' 1 s *-'*¥^^j£||^»**^«pK**^S-S^y*^*T'«*- 
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(SETQ PREPOBJ-INTJWISA M (TFWGL J£ . . . , 

Hon 

,.- BEFORE: f first verb>) ..; 

(TRANSL PREPG ',.-•-■ 

GROUP* 

ROOT* WEQUIRED-ffiEPt) .;■■,.._ ,.,.., 
(NG SIMPLE OBJ)))))) ,Wv ""' "' 

ARGSLIST is «n ordered lift of argueente of the RSS to bo tranaleted. The 
definition proceduro will eitupi regleter^o WlcW Ml to^jo eubetltuted 
for tRaxm^-W»% i*On t%>opo*itfori°f. gi^^ilJ: i^o^wt here and In 
PROGEDUfiE, u. .« . ^p«t«l »ort of Ci'll to "mWSL. Uh^ ROftt 1. specified in 
a TRAN9L dalt afohg with apart 'of^ajMadtrtrte offlnltldrt rouftnle^are by- 
paaaad. and the node uets tNe^*T#&, n#T, and, 1f gTVen, 'tne ^^eupp I lea- 
rn the ceFI. SlalHrly, W^i group & "oftee'le ! W>W^ah^am. ; W^he 
keyword GROUP can be used, followed by a Met of TRAH8L caUe and other 
functions. The GROUP feature aaHal Trpppbla to by-paee syntactic 
special ists. Note that we do not have y opacify the uorde In a group 

when using these features, for §K§epie Hhe pr epos It ion group fro* tfce 
participants list above* 

CTRANSL PREPG GROUP* . 

( (TRANSL PREP ROOT; »€q Ulf€0-PRERX 
UORD* *REOUlND-Pn#X) 
(TRAN9L NG StittE ORJTT 

SnNOOE: <seeentib nods of 000004 ergueent pf i 
0BEFRtfND»T ~ ■* " ■—■*-'_ 

This call says, "Translate a preposition group*, looking up the required 

preposition and using the second arguj WEND to feroduca a noun 

group." This ability to by-pats definition routines and syntactic specialists 

is extreaely useful, and allows us t<Twri to efficient definitione in a 

relatively ecohoaieat : aahner. The one : 'o%iliacf*liijnif'be 
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future thai Must be epecjfied along with fa* tarda, but it would bo 
rel*t1va4y easy to ifX»r|»or»t« * flnoaikting |«cm^.l« aid In definition 
wri ting, and to package the aes* coeeon tup*« e# F*0S*tiUHb in the eeae way 



that the participant* I iete he** hie« frirtiaprt ?f* : or fifteen etandard 



im> 



pattern* moo Id probab^ ha»dl* a a** »»r n*»tfeoa jrf |h* dif foront eultJ-Mord 
situation* that coa* up. «=-: 



6.3.3 Several rter**r«, Qn« Uord 

. The caw .liaavfc *ioou»*«d involved w <tonqw>t aorHoPvOoino to ••veral 
word*. Lot uiwi ta» a look at c^te tM. ^*ro aaxpr^t int*rre4atod concopt 

w»T<P*» JF- ..*»_ • ***!§kt **■£*,. . Thi« i* oojia uaing «£0^iJ)atioTt of th* ■. 
CB "°UW If* WWPWK **■*»«) GOWiUflN ^ uaad to^cify a placo of 

•MmtJe .rioirooaMtA^ the 



S"*"*, 



tWW#*j''. 



■ ' " 1 ' 1 1 1 ' 1 . nirii i 

RSS 

mmumamw 



MM 



PSS 



Aw' 



■?1B 



ftil 






l i*. H hu H 



"Tft! 



VtCmT**? 



eentonee, and then PfflQEDURE 



i» u»*d to toll tha^^rpot o^. ; |ha.ojinor|ltor how 



,W& * ~ **w^*r- 



-t^S'S^^V^*"^ 
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much of the semantic representation has been a b so rb e d by thp jMprticular word. 
An exaeple of a eulti-eerker tp single, word translation It the^generetlon of 
the adjective visible frpa the represe*|a|J,jDn rt^ajjii^fjiy^a ?«2.:c : l^^«;*h« 
etar indicates that ar>y aeaantic rode aey be in thLs j^^}o*p. : 0Bw *V&!? 
information, the translation routirre would contains 
*SEE 

- visible ■: 

CONDI TIONj 

(HATCH tTTtt-tmCi IPERSON) ^ 

SH-E 

fhUdlflERt #ABLE) 
(LINKS: IPERSON JSEE) ) 
PROCEDURE: 

(REMOVE #ABLE 0GENERAL MODIFIERS) 

Uhen CONDITION is executed, |h« ( roqtlna..|^^atf!r.ta^|^^.lt^ai in the 

seaantic representation currently being considered |»u ^|||is^|^, In ttila 

case the RSS for #SEE. The HATCH rout i|^ aoYaj ,tbr|>uj*j ; -Mm, WJFW ** 

LINKAGE paths specified, coopering concept eerier^ jpajnat, the jj^ojuajetit* 

given. If a concept Marker alone i.s not ew?uoJj_.^.dis^t^no^||h j| Mpd|fiec> the 

case nag be supplied, as it dona for the ^VETfRJ|fPT|. ; ^a^J|an jfboye* •• . 

LINKS, MODIFIER and FIND-LINK always rs^sr tp thst ^gjaa^|#, of the ra)l«tlan 

examined just previously. To changs the fooiis Ht pJ 7 a^aji^op. HATCH aay be 

given a semantic node, as wae dons above. HATCH la currently N pi ted in the 

kinds of coaparisons that can be aede, for sxaMple lij* npt i^ Doaaible to 

specify that two uses of the saae concept, .if$ujr sjhp|jy|d^ refer ^tq. different 

variables. Such extensions, however, Mould be straightforward. The Matching 

process is not a particularly sxpensivs one, since the ordering of the 

representation is always fixed, and. we knp^ ; which jpe^e are to be heads and 

which Modifiers. Note that HATCH is not the only sjajgt ojf routine th»t May be 

used as a cond i t i on, CONDI TIDN succeeds or «|U * ^VftijUPSbJP ^*har 1 1 a 
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associated proceoune succeeds of fell s^ 

Omm the Aeeajtfttta* -has bsen * ; se*rs*la^and., tW definition Is chosen, 
PROGCBURB it executed to ta>| I the gensratisr net to try tei translate the 
nodi f lor #Aat£ (where the case here it at van ae JKENBMU* since this has 
alrsaoty bnn -used* The only point war** iwwt4sn»f>*eo*ut PTOCSJURE is that 
its effects ewst be lis U so to the nods cmpt^^ by the 

TMtBk.cej.-th TMs i« necesMru to.~kMtft bejeM>vele*i*« *M»jeVwhen anode ie 
popped froo the tree, ho want all to*. 3 has placed on the 

generat i on- process* to disappear, with it* . The I ioAtaitons on PROCEDURE **• 
current ty s«l f- ie p oo od Jay the dictionary ue.4 tor, sines. Jos syeloe naked no 
checks cw* what is being seti 

Ths two deflnH Ion estho<»» diecutMd hers glvs the diet ionarg routines a 
grod* deal of power. The Method described In the I set section ai lous the 
generator to traosJstS a single earker into else* If isr plus noun, or to a 
whots clause if it is dsslrsdf. the eeihde In this section si lout ths 
generator to handle relation participant nouns such as indication, which are 
del i nsd as #bOritWWG plus a relation* in fh^^case ^eopp^lny. that', 
indicates. * W* can contains ths two eethods to tists a piece of sonant ic 

representation into acre than one Engl ish word, which should be useful for 
idiosa tic phrases I iHe> "Strike while the iron Ts hot." Givsn this gsnsrallty, 
the writing of dictionary routines i s rsiatTvsty siopls, and it will becone 
easier when eore etenderd patterns end procedures are built into the 
generator. 

6.4 Stuebllng Blockt In ths Generating Process 
In the sweep I s ussd to explain ths generating preoraa, the generator wae 
suspiciousfy suocsssfuf st each translating stteept. This is not always the 
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case, and Mvrfl problsa situations wi il be di scu s s e d hero* The Ur»t le the 
need for backup, then the issue of repetition, : m&.-j^ gni&^ -mf? ; mm*%m*m 
length. 

6,4.1 Backup 

There are several cases In which backup, is iieeaeeery. The aoet obvioue 
i s the caee of lexical papa, m ■■^m&m&mm* If* Mp !lttm^ ^w^am* J" 
general , if the first trans I stlon stteepted l^ the ouetoe does not succeed, 
the generator needs a way to ><tri»4ha alternative* In en orderly Banner. The 
Mechanise supplied to hand »• backup In ths gsn«ee*er is a fairly vnmrmi one, 
although J *eve i^t* written onough bacliia* eede wfriW&mln&Jtm give «om then 
en initial report on Us perforeence. The reeder aho*4d keep *n elnd that I 
,.aa discussing a facility provided !--ta al4ou, the ^e tlgn o r to ,Hpp^pone^»-Hte|Bkup 
into progress, not a full -fledged backup oschan iso. ,.,■■- 

There are two I ovale In the generator uheim choices or* toasts* a* the 
lexical I aval and within ths syntactic •pecteWsta, s ^tm*%immJit/irpH^-iim 
sets of I lets that era Maintained -as ,r#gia,tere .m. node* for ^aoM» purposes: 
LEXALTLIST (the lexical si tarns tju« I let) and ALTL1ST ^f^#*ruoturai 
alternative Uet). JUhen a definition ift#vatua|taAtas^ 
possible definition lists thai passed thslni tie! scrsenlng doecrlbej| l« 
section B.3.1 ere placed in the 1£>WITLIS| t ,ef, the^ node belno created. At the 
structural lavel f k avery chojcp, that eight- jpe^ ii fr pa n fl s n ^ 
uould not iepig autoeatic rovereai of choice* sees earl lerJ. leevee, tf<ac*# on 
the AtTLlST. Theee tracks consist of Jhe <: functl«n, in which the choice , 
occurred and a etateaant. label .where cedeJpr, tysi £Moj<na$ljs* f ,cf^ca^^ 

If the generator runs Into dlftlcukty^,^ at 

a given P©*"* failf . special backup routine ere c#J ledjg^Jnb general t ttp. 
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first step taJ^ 4*M;M» to try to retranslate the relation for uhtch thS 
probleestic twwentic node is * participant, If such a relation exists. 
LEXALTLIST's are used to supply these neu alternatives. Backup routines do 
what they can to keep other participants that ha%» been genercted intact, that 
is, to find lexical alternatives Hi th a4»i lar part I clpante shopping lists. If 
no lexical changsa solve the preb tee, ♦ i i*iHJc*i*iN' : ettpiBP'i s attempted. For 
thie, ACTUS? i» used, ami en attempt $• eade to nsveres =%he poet recent 
choice f irsi. The offend ma nodes ere popped ! #li!' : thf ' tree, and centre t 
returns to Ww routine end l«Mt specMisdiby 4UUST. T*e eynteettb 
ape*rMtMsts are structured In such e way the* business mJ 1 1 proceed as usual 
after a backup, except thet the ALTUST hee been reduced by one podPibillty. 

- For- the 4 Imited wi i b e r of ceeet s ito ap t ouy thfrs backup sch oas seees to be 
#sner*l--ew«|ejpt. ; .,:,4:t.#e*#-reet. on #**,aeesJi|*t«b ! ^ chbtcds *fcan be 

fully order with respect to each other, and that this ordering u HI not Vary 
frail ona -actuation to another. Only eore experience with the generator ui 1 1 
determine whether this eesuept Ion s Homo enough generality In the syetea. 

Before going on, I Hou(d I ike to consider the u*e of backup In the 
context of ganerp^Nxi. J t pay be that f\Jr^he> --¥l&i^\'*\if- poM fa'ymip to 
cut down on beckup by deferring wee decision* fb "TFalpr iW the process or 
anticipating others. 1 do think, however, ftttf beckup uiil refrain '■*%■ 
ieipor*aht facility In i generator. Unlike Interpret Ion, the pope lb I e 
outface structures for a given eeaentic representation are hot fixed. There 
can-be nc question here of generatVng "al¥" the pessWfmse instead of 
backing up, since with a faci litg for paraphrase, ue could no doubt go dn 
generating peecibHrties to ra^x^ rldieWbuf e^treiiNr. ft afght be desirable 
to generets, eay, the three east I iksty fextcii fossib^flflds it any given 
point. ThNi probably would not cause coebinatoHc explosion, since possible 
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paths Mould bt eliainated qglckly. I djjnoj havs enoiigh, paja^a^f .*lH»i-th» 
generator to judge the ^ojlit^l ltu^of' i .thJL«) t |^|0^ % ^ T aut t ^^aJtf;9P >m«||^^pH 

Mechanises are introduced, a backup ie©be*i»a will r^deu| t s r.ffa[ T n an,, 
important component In a J§k|„ generator, „. , 

6.4.2 Repetitions . _ .. ^^ _ s _ 4j # ^g, ,«...,, 

Tha phenomenon of rapatl1tlonJt.|a^afjplla^ •tHU.?^*rflM«,lr^t 
is. waning is not affected IFJIo^l^^.^Bflll^ if^^hoji^r^ bp £ a^^a««ly 
distracting. For jxaaple, thf^ Irs J essence of 4»^«ffl»«f ^f^fp»! ^ t •' n • 
tha phrase "el n d^tllch .^chtbare* ^c^V J^ ^^M»?e%«W* ^ # 
contains jtUibJ^ ,ynmm*^PUf&> bf^JfcJt^^HlU • 

"a dearly visible l^cjl I an.£^it, 149^)3 & fff*!.ifHQ9 tf«-*IIW*W "*■ 
chosen ajOgjoJt for lWU|M|| f , S^iBPj^^tha^tha^^f d^tj^y.anly , 
contained one translation for c foutUgh and that one was exLdJQi* : , : #n ...... 

"evidently evident indication" .founds^ |A|*!frJ!^»J4i*R^f5# ^iMv*". •« 
the AOJG routine had better r§tran f l|tj^^^ HWMkw 

eoaething a little less repetitive, . \ ,... , n a t , , 

Not all repetitions era *^^ in a 

sentence would not be noticed s^sii, ^IJfJIp^apggg;* wojt|d ^Sba, 
overlooked. In addition, soBj^r^Jtjen^of.cJ^^^slrj^j^re is deairatoJe. 
e.g. where para) IcMse J» «•«' §• I >f*tfiJfM c ^^* ? ,./^^M» ^ff^««p*» ln 
f unct i on tend to Ujf ! uence judeaonts ,-9*t£t ( .i$|£ 1 a SS&fMSi "*.* 

(D He went to the *op Jo talk Jo |ba n e|ephap|, 1it t Vj 

.(ID The problem withaeoting with tha Board : 

is |hat Harry won't coae. , :aa ,.,■; ^ .. f ... ^,,, ! , 

Here, the two uiib's in Ml) ssea aore proainant te^ftja^.tyfe^^t; In U). 
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This 999*9 to tie because the second la heed* ah adjunct clause rather than a 
prepoB it Ion group, ?he eobstion of uhtoh rabetltlona era repetitious, then, 
is not as straightforward »• It would eeetu £ 

In considering repetitions, let 'Ss ?s W«t oaks the distinction between 
structural and lexical ease. No efforts ddr* esds in the ousted to handle 
structural prod I so* like repetitious c I sues types, rflpviltieus tise of 
conjoined structures, *od other rs rati vs I y high Isvsi ph s no ao n s. Heurietice 
to hand Is these sorts of repetitions MOuld be KsciWu ' for a coaplsts sue tee. 
but 1 aave this Idu priority hsre. ,; This self si on was based on the aeeu«ption 
thst snough of the var 1st ion In the or ioihSl tskt »<ould cose through, eo that 
ssntences in ths-ewtpttt would not'-b^-odrt^'cep^^-d^^aeft tfther structurally, 
no a systee beeaae sore refined , however? V* wbutd need good heurttttct, since 
tho dl fferaht spoof allots are falssotftottsrd par tTcutar syntactic tgpss - the 
onea ' that they f#£ ff*s* ' - ' arisf 1 t would b* ssoi *f I Ihtb^ifr^lturi'r '' ' 

repetitions. ' '- ' 

For iSKlcal rspstrt ions, another useful distinction It bstussn function 
words and the rstt. Fortction uordt »>• l^saesltlons, bindsrg, snd '\' 
conjunctions. Since thou fore a clossd set, Jl^JWsi'l '~bi preferable to handle 
the* bg anticipation, rathsr than backup. Let ui lock's* ths systsa's 
binavldr for'ppepldl'rlona^ only check of this sort current Ig 

iapleaented. tfnena prapesftlen I* to 6e oeosrstsa by ths systse, ths 
preposition group spsbfdltst first le^sbsC* througn what has been generated, 
stopping at a sentence boundary or ths verb phrass of ths esjor clsuss, 
whichever -codes f irs^; ;: '"ff"e%hdr'"preps»»llo^ ; iP^^ s list of the 

prepositions is node, Hith thow that wili bs niarsst to our nw preposition 
group appearing first. Ths call to TRAWL "fl'lhsn sods, using S tag that we 
have not yet discusssdj e -'■'-* 
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(TR^jSU ,Pip» NOTi (prop! prop?*, • > , ^W^WWR* 1 * c^"?^ * •• . 

Ths NOTa dirtctf tl^ Jlcttl^K^r^^^i fjgu to W |d tbOSS uo/?o\chotc»s. 

It tnore is, no p|her cMcs . ; ^AI«ft4f». B |^ •4tUtit«lol»<«& WWt!»«W' 

by* ttw.^^iii^.^^nfi^Mifi^^^.-iiai^iffe t§iwmtotei*w*mitem fr °" 

thoond of thf Ht$^ ^rfi^ip^^t^fD^diipllffilliiy f^-ffp^MillfctlflBi^a^rffimj^cflPflr 

thf noH on* U? the ssotsnc*. ThjJOI |||^^i|^p|n|4y t ^i^|^^^ly 

bo used to ovoid structure! repot it lone tike thoee rtofgr^d ^ tho loot 

Jpr cpntonl.^ofd^ lot ut ■**• «* f ( l,nol a>tU*sM»V W*m «W«*» * f 

qoroforont noNQjgwgn ood «ti^ mot tff « j$& mm?$m$m '&*&&**** 

about rojjftlti,ow| ooong noun groups. bu| A|tdof)i|M«Nl>ctfM«P9 •***nI*UM'* 
. U i.nvolvsavh^s.^.1^^ 
coroforwtiol noun ^oi«t, J.tcfR «i|hti? ttgp<u*»m^^ift* **«» ■*•**«» th 

Ita as dotor«Jno^^ : J4.,«ffl«^i|Wy| timd faJ^ijm/h ^^t^Wm** ■ - 
pronoun. That is, ths systs* can refer book to "o big grey octopus" OS 

either: __ (>*o<'>^» w * n ? '* t»0 i? *•' <"' 

(i) . the i>i a or sy octopus ^..»-^ « -s .i;gjw< vv^ "r 

(U) thiS. OCtOpUS -.;■.-, ;..;;■-.' ^0 f j SSMj? W ''] i^ W« V" •. ' '>'* 

fto..chocKs i ^o.i;u^^o^t^,o^do scout »}&e^ so J«e^ r ^ th* trajeaj»M*#|HtHFif 
novo sloi 1st doscc!|itiojf R A »opbint4osts^ s»» %fj a r upuld^e^upabjy be ablb 
to us 1 oh est s<uM4J ! > tt, 3 ob^lop j, about :*i \M^tmm &, , tm)*Klm tMmmJrcm 
corotonont noun oroups, p^aU»UV*M«jOt«b»*«ll^ Ifk^tstwfc,, and itior 
ropotltlvonofMU ., Any .heuristics for teajHJ^ao. i soos^ jsy i f o p s nt oou* *«**>•. 
therefore^ Mf$Jd . hajiLjg. £% f iij^p^teg^c^ps njuja, s *o^ sojs r lal i#* very 
carefully. ^.i.?.4^r-w ■:■>-..?■* -*<* ?*«* -v^n ^ .-i bm-M£ 
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Al I the waet^ef the content words I J*« our "evidently evident" ekeaple 
are bandied uetag bbehep. the hstrtetfe currently used It that if the root of 
•n objective, noun, ort*** it rbpaaitd anuuhwe uitfein en ar puaant of the 
•bin verb or * eodtf *«r of the *ain cteuea. then en btteept should be hade to 
wr*et the eituetion. -Uni* imot»ot a pg s Tlainaf y cheek of IBdl-tLIST end 
then en attenpt «t» rep tec* the nee* *r «*»» effected, i* thie does **>t uork, 
the fopeti-tion H <*****& <&htm te« 'Mt'M'^JMiM^i 'pheeibie to initiete 
•or* erfteneive toeokup. 

The heuristics used here er« obviously very slapi* end Melted ones, end 
a flood deal of retJneusnt uiit %e nsuM s »y . 1*ebbblv *» eabkeot way to do 
tWe would be 4b'i#*lM^*bbO*Mtt^ heuristic* as 

po«e»*tb. then ran 4t» «u«t«i en soee tort. Thi* mm Vd give the designer a 
chance io »ocet« ttw ropetHlon. tket a^ the heert 

■of the pr^kw har. eeeM to 0)« to b» the is***** *; aftd once b^ 
are *oond. tomUr Hiptoaentstlon ebeuid fce -etfe leh ttor iwd . , ' 

6.4.3 These Long C ar e en Sentences 

Sentence length is a probiea that soebs trivial until one starts to think 
•bout it. Bound up in the question 'how long thou <<1 a sentence be?" ere 
issues of the sentence's rot* •• a carrier of wesseges. To chengs sentence 
tehgth we hews to hneu which Inforeatlon cen f» eefely sspereted out and which 
ib becerittbl to the integrity of the sesaeee. AM Waived here are 
questions about mmm nebory eapaettie*. Any nothwi of content tel complexity 
eust take t#to account hue** iH(rt.tsre eeeory and i»rw»«slng I ieNatlons. 
Soaethiny that (• couptex to ens bert of suets* eight bs handled m J th ease in 
• el f#*t iy dWf erent one; Thus, whin be 'try 1 ! to determine what i 1 10 a sentence 
ohould be. ue have left surface feneration and ere res My talking ebout deep 
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generatiom the process of organizing eessegss. 

My systee, of course, is based on the assueption that surface generation 
in tha target language will be sufficient. I have aeeuaed that in general He 
do not need to coepletelg re-build the Message for the target language, but 

,..,«,■;;"... ^.^"....^^."fr,., ?':« fi.,;i "'X. ": b$ m^ T Wv::'i < flirt* 'f?G- ft-'' .tX<«"*- "£e;S5 

rather can generate frop the coMperstively high level of the.seaantic 
representation. In Most cases, this ssbms to be the Meet efficient, way to go* 
and, since ue Hant to foilou the organization of the eeurce text as closely ae 
possible, it seees to be the Best effective as well. In dealing with long 
eentences, I have continued to avoid true deep generation, and I think only 

- • -. .-■■<■ •■■•- ■ * •'•v'n: ?•■'*■■;•: -.:;■:' .;:«■>-'. .■jy.*".-? sr.f ^ ; ;JTj«"1 ?k*T ,str>:p ■„.•:■ -"-O ?*n: : " t;'"- 

further research can deters ins whether this is a virtue, or a vice. Let Me 

.;■:■■ ■,;..-, :■■■'? . ;-.■ ^■l*i." ■:■' -"^"*' :p ■'**•■. ; ;v'o .3? ■^JudTnffKW !»« yw 

discuss long sentences in eors dstail, and then I Mill cope, back to this issue 

- ... ■ ■■-■ "?.■*:■:>:'. :-;•-,,; --.jv.. -^v- \l~^:->* "t:> «M , WHS < F?T J <"-•« ^- ! '" -^J'ii-i- 

below. 

First, I should say that although ecedeelc Geraan has a certain notoriety 
for its long sentences, the sentence length prop lea Is not unique to the 
languages chosen here. That Is, if ue were translating French to English, 
instead of Gerean to English, ssntence length would still be an issue, 
Certain things are always easisr to oxpress in one ; l anguege than another i it 
■ay take a clause to sxprsss in one uhat a word can express in the other. , 
Because of this, it is possibls to start f roe i an uncoaol ice ted eejnjtnce, _ ijn any 
source language end end up with either a very, large. and coaptex translation or 
e succession of vsry short ssntsnees that should be coajbtned. 

, ;.. ; S;i ; » ,. .-■■<;*: ■■■;;: ■ :i . :,'HA ■-i^'-f i"« ;l slabs' 5 ! 5|}W6 .S.irjS* ■•-■;.-..,■ • -:£:.ftf 

In the systsa, I have concentre ted on dealing with eentences thet are too 
long. This was strictly the result of ties Heitetion, since shprt sentences 
can aa just ae atyHeticwily offeihlfve, ahd^fhei^f^i^t^^ 
ae tong ^>«ss. (A I though possibly sho^t sehtsncsi #r> nbt fe^har.d on; 
coapreheneibility.) I should elso add that the heuristics in the eye tee mrm 

-very fr sj ws n tery -and wsrs added eors to Ifftrooucs ^"^ilu^^ah^'tdl^tteapt 

■■• ,-.■>■ . . •■.-■ ■. -V- ■-•-■• ■-, ■' •,■:■■■ v.... •<*"..- "*; ■*''--- v 7i?u .!x;;ld ,r;.^»*«a . ,yo» ; 3i r 
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any tort of cowprc hs n a ivo tnswsr to ths probisa. 

Ths first question that suet be aeked Is "Hon long It too long?", which 
should be ieeediatelu reforsulttsd t« "Mou coepleM is too coepTsM?". It is a 
coeeon observation that 1 tertt I vs structures ere eeeler for people to 
understand than, recursive ones. Ludwlg ftainers (3D warns against ths dangers 
of "Klesekonttruktlonen,' or tttccessiv* oabedding of cleueee. So length alone 
is not ths prebie* here, a I though often, of courts, excessively coaplex 
sentence* are also e«csssivsly long, length is one factor In caapletcity, but 
not the ©niy one. The rseursivs •tructurs* asntlenad. In f set, SMsapl 1 fy one 
very laportent contributor to complexity. Uhsrwver a nueber of things ars 
started but not tin) shed, us can expect the ceapJettity level to. bs high. 
Similar, but not identical, Is ths coop lenity introduced by back references. 
If us have, say, three back references in ths stes sentence, us ars not only 
fighting potential asbJguitiet, but also ths overhead of slaply maintaining 
the links be tusen core ferent iteas. The coeplewity Introduced by en 
unfaai Her word or by coep I letted sseantle content euet also be reckoned uith. 

Sentential caapis«iiy seeas to be an additive phenoasnon. Sons eentencee 

that I consider *b» complicated in English sees to be difficult not because 

they contain a dingle coapi letted structure, but rather as a reeul t of a 

compounding ot complexity* To tea what I seen by this, conetder our prize 

exampJe sentence and a translation that pareilele Ihe eeritential organization 

of the Garment 

(i) Ein doutlich sichtbarss Zsichsn for die tm Nsrvensystsm verlaufenden 
Eregungen Let gUie Spiel der Chr e ei t eah srtn oer Cepha» tp saen t Jener unter 
dor «eut lie gend e n as lb, breun, »chuorz, violet* Oder kerelnrot gefarbten 
Zetlan^ die «ic^ eptvte^ xueaeeeo«ie^^ee^ dureh madife #iiaa*e*nden 
tluekclh f licbenWeft autgebreitet warden klnnan. T 

HI A clearly visible indication of the excitations that run through the 
nervous tutta* i • the play,of ths ^oaateph o rea »f the oephalepcd, those 
yellow, brown, black, purple or carmine colored cell • thet lie under the 
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skin, and which aither contract or can ba apraad out under tha akin 
aurfaca by radially affixed eusclea. 

Thia tranatation would eound better ff ttf«e)Nf«i eemitfeee ^wat* ueed, but 
what about the fo Hon Iho? 

(ii* Ein intereeiantaa Berepiet da*ih«*et«dea^f#* oW Oa-aawtopheran. 
jener untar dar Heut Hagandan oelb, breun, whwari, violatt odor 

k^\TT 9 ^mmmm^m %\^mmm^ m^mmim m v warn ***** 

radiftr aneetzenden Muekeln flachanhaft a u saeb r eltet war dan kftmem 

(ii) An intaraating exaapla of thia I a, tha plfU «>f tha chroaatophoree. 
thoee yal Idtt. brbW b*3f p*pra •^ttrMM -Wafil*cWf»*«MH far 
undar tha sKln, and which ajthar contract or can bo apraad out undar tha 
akin aurfaci byValftffS aWMaieitei.^ ^ ! ^ ~' '"-"^ ' 

To «a, tha EnyUeh in (lit la fine era tingle aanrBirk^^^drtharaiir^rr^ 

not overly dieturbed by a eentence like* - ;»^-*- .. ?*-*!■■. v 

tliil A clearly vlalble Wileitfoh of 1^e»fcl^lerte**hi< rbh-fh^bugh 

Q^J^JSMNM atWBBWlMTiJaSSaff 

Therefore, it seeae, to ee that no one structure (say, an eabedded one) can ba 
cbnaidarad coaplox except with reejpt to _m con^ a ^i*^ ? clauee can 
tip tha bajanea in an Ptherulep .accap^ab^^aaii^^^|Kit^Ma cm noy>in the 
blaaa on tha clauaa alone: it io tha whole aantanca tat least) that 
contributes to Judjeaanti about ^.poaa^ityv ., ,, 

It is not always easy, then^ t^ dfclda which »»ntencea ..ahould ^ be broken 
down into seal ler ones. In this raepect, | think Jf°»«« hava an aaa^ar tiaa 
of it than aacMnee. ainca they haya f tja^4ir^qy||^oo;of datar||ln»ny 
complexity: if tha eind bogglss, ^J^p^cceiilex. ^oa^^a, whoaa ainda 
ara not atructurad in tha eaae way, ; #on;,t ^bo^lfJ^tne saas way. But avan If 
wo lack a nodal of hu«an iworg and procjee|ng, I ^ ^Jnk|t.|ajoaalbla to 
dorive a aeaaura for sentential coop laxity,. ..By jmatyj,in| eentencee judged by 
people to ba too coeplsx, it ahould,ba poaa]bla,to feee H up withji Jorau^a - 
not nacaasarily a siabla ona^JhpuBh -to pradict which aantancaa ara 
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acceptable and which are rtot. 

A look at the^epactf to caees handled In the aufefi.WM.kl probably be 
helpful, but before the "what," I oust again consider the question "when." 
When should the trans te ting »u»U« as** these checks on sentence, or nore 
proper ly mmhi , length? There esea to tieseversl option*. First, there it 
the deep generation approach. If «• •tart ecr generation proeeee froe a set 
of eecertiene froe the dots bass of the Seductive coeponent, we can include 
I initattons on how large or deep a sesentJc representation should be, or whet 
patterne *4 J Inlwga ean eceur. «l#sT»dtt¥p ly, us can rotsin a surface 
generation echeae but do eoeplexity ens I y» J a on the seeentic representation. 
Us would be using the «a*e sort of inioraa* Ion (aJ though probably not 
identical progrew) as for the deep generation approach, put ue Mould be 
analyzing a structure after it has been built, net In the process Of building 
it. The third approach is for ths surface gemrstor to benitor iteetf. If 
the output has becoae too coaplex, then suitable action can be taken to split 
the Sonant ic repreeentstloh, possibly rsfertwlaUng Sboe part Of It in the 
process. 

I chose the third alternative for the systoh, prnttsrily because 
complexity of expression is IsrtguaQs-espenoent, tfhile the tseantic 
represented on is a gesture toward language independencs. in a stapls 
inspection of the settarittc representation, m« cannot *MPr ohathar a relation 
can be expressed by a uOrd, or whether • clause wTff be nscstsary. In the 
present tystsa, it is easy enough, of courts, to dsterains whether sing Is 
words do exist for a given concept. Th* point to, however, that us do hot 
know whether a particular word can be used in the translation until we take 
into account both the intsrrslMlohshlps r between parts of the eessege and the 
linguistic constraints on structuring s lh thslienteflie. 
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To br I ng the dl ecuaelon down to -M^m «ojip£a tfc J **$ l. ; M* Mf «£#•#»«• 
with the exaeple eanta/icf,$lve?j frirt,i«^^ M»ldM^^# ^PI^M^^fHI^ 
coep I exl ty , The hew- let i ct preaanted here fare •* •#F t N» fl* IWF» ,«M JfhQk-. 
repreeent the only cape current ^he^jfj^^^^fjf; e> 4 .<**$*£ the* 
auggee t * hoMover , the .direct I on in . j*>i ch eot ^he|l| Proceed, . Thf f Enj[U»J h 
expaple, age In, uaat. ; •■ .**>»* -- i "-.n*:. ■. 

eurface by radial ly aff}wd.eweclae^ , 
One factor here^ejei tojP^.ijI^i&Uy^^b^^P^teyM* 1 * 
Indication." Ue notice that JUgJMlftM»*ft £* , tlkt 1 ^|l%iWi!f H *'- • nd 

that tin oth#r , TJWf"? of : ^-&W)*SVWfc4t*PMM)y$* a im 

preaent p« a prtpo.lt Ion orpup ("pf ...t^jap^g^le^g^ Ja ca/^l j^p e^pple 

noa i na M zat|on (ev|« f^|^^ ^ ^H°l W^WWBlJ'fff 1 "*f4 l £"4« d " ,f 
a 1 1 tha arjpjeenta of ttja. r«lp|!.on appear ;,.iNj|l{£i t ly |n ,tya) noun £$i£f , T"* 

auhject of aentence O) Ja |hjrefp^a,aaturj|^lt J^jtftf fcilfw-P JM.V"«W noun 
group contain, a poetnoalnfl rer*»hlfUd , «pf ^lea^ja|||l fl||fm' oqa^ .«* ,,|t» . 
argument* of the rel at ion, up can cal |, 1 1 f ,"e^e§tu|iteo^ The au^Jept i n 
(l) ia a lap eup^ea^urated,, pup %o tha RSQ clauea ttpt eodlf la*.4U«jLUUPQ* 
("that run through the ner^pua euj$ep!,» ., ,Wf, JmP&PVP iMWflfiS ftt** 1 
noun group to be a keg cojjtri outer lp,jepje^t I §l„„eap^ 
auperaajur-ation alone la,r»ot:e*ipyjb, l *o...ca^ 

(ii) above). ■..-., ,->yi ?u '.&,.;■*-- *■' 

A further dlffjpultg with »entenc« i\) ,ie, |hPja|DeaUJ^f i: phxeeja that 
atarta with "thoae yelloy, ... and.carpioe-ceiof||l eeMa.'Jfpra* ..tup, 
poetnoeinfl RSQ claueee aodify . f*U* ~«g*49 j^MfhW^W***^* con '° ,n 
the two Uting ajjd. Furtharapra, th« tacpod pottiioainel claupa he^e ;|"j*$ch 



*..$$$g%&g^#**.; ^$*,h. 



168 

either contract or cen bs apr sod out ...") is itself conjoined bg el thar / px.. 
This double I «vo I conjunction I ooK« I Iks »n }i*>ortent contributor to 
coaplexfty, but M la difficult to t*ti :m** :: 4o~'&&">ih* tin*/ If our eecond 
clause were "and which either expem! or contract," ths apposlt I vs Mould 
probablg not be too long. In (i), however, the too clauses ar« rtot parallel* 
(Tho first is active, and the eecond Is passive.) ThJe hmi to pueh the 
appeal tiye over |pjj^, J^t us sjjy, ; *hon* Jlpt'f* eppQsmvs con bo 
considered coop Is* if it gotts beyond two level* of jHRailJpl conjunct ion, or 
beuond one lav* I of non-pars I lei conjunction. Qnos again, a eoMptex 
eppeei ttve atone should not trigger rejection of • son tones, since exoMple 
(if) sbovs do«i not tM« to b« too cosp+SK. ■"" 

By coMblning « supersaturated noun group with a coop I ex appoeitive as 
def ined above, we do get a Sentence that if too coeplox. In dealing with ouch 
a oontence, mo would expect s gsnsrator to Interrupt processing if It detects 
an output that satisfies both eoeptexttu conditions. In the case of exeeple 
(U, nodes would then be pbpped free the tree peek to thl begrhhlng of ihe 
appoeitive, elnee appoeitive* end conjunctions ers natural places to break up 
a sentence. Uhat Is left on the trse (In this ease, everything up to "the 
ehr oe etephfl res of the cephalopoda uoutd bscoas a e^e sentence. The 
apppiitive part of the seMtntfe rsprssentsHon would bsUstsched and would 
becojM ths sscond sentence, with generation ila?tlhf free the top. Note that 
given the slMllarity In word order bstueen £ngl ith Mjor and secondary 
douses, it eight be feasibJs to reaodel see* of the output that wae already 
generated before the interrupt, Insteid of discerdtng if altogether. In eoftc 
cesee, however, this patching Might require a efftafn aiouht of Ingenuity. 

Note that not all long sentences con be split as easily •• the exaapla 
here, and, for soot, we Might not be able to find i division that Maintains 
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tha integrity of tha aaatJigt. Still, I think thtn^PPU fllvf# «n idee of 
iom of the l«tu«i ijnvo^ved. In haiidl.ljng,^ten^k*e^ 

reaerK that this ••pbMl. on ining fp^^l^^^t^^H^? ^MT^Ic* 
■ay be * I egul dad. 1 1 . eay be tha^aae^it ta &&ff$ &t0$ *ft qRR^JlPKP 1 . • 
heav i 1 y . For trant I j|t I on, howovar , I woujii ,jtjg*|,4£ f^p n ht|f Jf %1 o^ t expro.eod 
In terne of the surface structure pi the tarjet JengueM, since • |0 JWW 
eoeant I c reprosentet I on has • I rasdy been aej^eo; l^&flff^Mgfygfl^* thm 
aourca language* - .„ ,» .. 

Evan If It were "tended to J^pdla^ |% f^^fSf *£ #W££*#M* 
etructuree, tha, oanera [tor discussed h*%£M4&J!pMP #fttyy° f^lhaWM**" 
on.. There^ are at 1 1 1 .o«* vtry important orocejee*^ #n»|*?i been .. 5 
considered, and I would I !a^_ ^ ^tJl^nt^Sini^r^^laV,!^ ^(9 #P^M9P- , ^Hathlno 
. dlacuaaad hara It curi-ant jj^ iaplwf|vtad, flrlaar) ly »as*waa, a ora^t daf I of 
additional apparatua would bo necessary. 

,6.5.1. Dealing with Aatolgulty , : .^ ?i: j. . ^ . ' 

^iouityjs not en issu. ipr^joaipetgav ia thjh#«*f W&m% 4M» 
for Jha intarpratar. but fnere era, weraf ouaa44enf , u*f *& 
aechanjaes fqr handy indifferent *efM*ef e*b4oMHfc < 'MiM tovdl l - w ssd larai 

would be quit* ioportant. .,r*v *rt? -.■ 'o^ ■■-■ >r ' 

Thf f i^ft faf^a i,a thaabiliiy to traneiete.aeWpwUy-fer-eeoiguHy, 
Mhich ; ,u|s, ; ,,aantioi^^|n,,,th|j intcpductlofi. [3 J^tlHf|. ^Ja^wo^evtaciteueeftiiA <■ .«> 
aochaniaa sine. I find aysslf doing H occ#»laoet4y,H*ea,hand traj?e*«t«»§* 
For poople, ouch an ebi H tu !• u»«d east |ap &mm iWf*KMffi«# **»Mclt 
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Information I ike understood agents, etc. f?or a translating systsM with a weak 
deductive cosponent, the ability to translate •*blguitu-foc-aMblaul tg light 
also be used to avoid choosing between different senses of a word. Especial ly 
in languages a. close historically a* GerMan end English, it is often possible 
to Hrtd a word in one language that is sebiguout in the teas uay In the other 
language. In general, i ayitee Mould ham to be quite sophisticated to 
translate aetoigulty-for-aetoigultu, but at the Is* I col level, ue could sake a 
start with a liner. an English interpretive dictionary or ulth an associative 
ability to I ink, concepts to definitions. (Currenty in the systee, ue can aek 
what concept Markers are associated ulth a Gtrssn word or what English words 
are associated h I th a concept Marker, but thsrd Is no way to find out eaeily 
what concept Markers ars associated with English words.) It would ba a 
relatively sistpto Matter to take two concept barkens produced for a Gernan 
word, look up a tot of English words eisbeiaief wi th one of thee, and sse if 
«^ English words in this sst could a red havs ths othsr concspt Marker ae a 
Meaning. 

To translate aebiguitu^for-aMbiguity beyond the lexical level, the 
generator would need a Model of the English interpretive process. The task 
here would be to analyze the uay the target language wis aablgubus by 
inspecting ths eeMSntic representsHcfr of *by accosting pre-packaged knowledge. 
The prs-psckaosd knowledge eight express such feet e as; "In a Gsresn 
hOMlna I Nation of a transitive verb, the genitive could be eltner the subject 
or the direct objsct of the verb, if no other participants are given." The 
second phess of this tssk would then be to f tnd an Ehgflen structure witha 
elaHar asbiguitg, which for this case happens to be an fnglleh noelnallzation 
with an ai. To usa a faMitlar eKOMple, 

das Schist sen der Jlger - the shooting of ths hunters 
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("the hunters ahoet" or "thf hujifere fr^sho^") > 

The node! of etructurel aaOiguittao ,«|g^t T ff ,cH^Wl«( v f|ito||||ci^LH ; ,fr«i • 

pareer ut«d hero. Uhersvsc It coeet froa, however f m uo^d jai^euch, 
knowledge about aebiguitiee to be ctoteched froe the parser, since we do not 
want to have to siaulats the perain| 4 pro^reve^ i^ m|i#a^| to d^t^iiino 
whether a structure is eabiguous. 

The ability to trans lata aeolaAiity-for-aaoJguity |ua| dlscuseed mom Id ba 
attractive in a ganarator, but not eaaantial. Tha second feature I will 
discuss hare, anticipating aablguities in the output text, ia aore crucial. I 
think, since it relates to the reliability of a traneUtlng «ustee. Uhp t I 
would like to consider is the •l^lof] •^•^^.•^{I^Jf^^t^lU 
produces a ayntactically or M^tlc.! Ig a^loucK,. Mntanca. J^ m % j^rm 
step one could take to avoid this problea would ba to feed eyeru tewt. output, 
back into an English interpreter, to saa if what wae produced contained 

,,, ■ i-'s'- ■■■;'' ; <,\ '[" ■ "t- v'i.. -'-•' : .'■'"': .vV^"lH^' -''■'• -^ r'">:»V^ : - ! v:;. k ^p *-. -it's? .a «*;.£.. ^* ^' : ' "'•'"* ' ""' ' '"'''" 

serious aablguitiet. This would correspond to a tranatator reading over and 
correcting a translation. It Mould not be an absolute assurance, since the 
proof-reader would preeuaebly share the seas dsductiy^ oaja base use,d by the 
rest of the bus tea, and know I edge liaitatlpna .fright caujejjt.. Joys las 
aabiguitiee that are present in the output text. 

Even if such a proof reading facility ware avsilabls - ami especially if 
one were not - a generator should also be able to anticipate soaa aapiguJ ties 
and avoid thea. In his thesis i^i. Hill catalogs f pur payses pf glpbpl 
syntactic aabiguity inEnglisht 

(1) Choosing between participle and gerund 

(2) Choosing between noun and verb in clause firet word. position 

(3) Choosing the correct transitivity for Jha varb ; 



m 

(4) Choosing ft att ach— n t point for « mod Hglng phr— 
It fe ciiear thdt. given the features aa*ad on tbs generation tree, checks for 
these sorts of awbJguttles uould be relatively straight for nerd. A generator 
alto Might be able to anticipate certain Meant ic confusions about the scope 
of a quantifier. f*Quantif ier" is uted here aa an English part of apeechi 
see Uinograd (33, p. 67). > Checks for these aorta o? oablguitles could be built 
into the syntactic specie* ists «f a generator, or the ease information could 
be oMbedded in a seat I routine that would eon iter the generation process end 
interrupt if one of these aublguous structure* Mere generated. 

6.5*2 Uhen AH E*«e Fells 

In the lec-feeentatlon aa it •tanas now, if alt the backup possibi I J ties 
ara exhausted for a particular sentence, the generator siepty faila. Thia 
uould obviously be uneeeirabls behavior in a working trenelation eyctea, era! 
us uouJd went a eustce to be able to eeke the beet out of a bad situation. 
There see* to be tuo direct iona one could go to eeet thie goal, one being 
eoapreeiee and the ether paraphrase. Beth featuree uould be deeireble in a 
suetee, but it it not at all clear hou cueh behavior oou Id bo produced. The 
two uilt be considered briefly hare, but no solutions wHI be offered, since 
they depend, I think, on extensive further reeeereh. 

In the current version of the generator, backup te hendled by trying 
alternative choices, but this Is aluaye ulthin tha context of a eet of fixed 
choicest he atteapt is eade to iuspend rules. In actual situations, it ie 
entirely possible that no combination of permissible choices adequately 
translates the original, and, in this case, tie uould uent to produce the beet 
approximation possible. This could be dons either by I saving out some Of the 
content of the original, or by violating one or aore of the rulea of the 
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target language. (The other .alternative. I. ^spbr.tf, ^ic* lp dl.cu.Md 
bslow.) In inril.^ * r -^ F ' ^1^ f°^ ^?i^ f f M^**t#f » M i » ^«?tlf^^ ?•„., 
viol.t. rulo. rather than to )ff^|pi|,^Qifi}, , The^heJ|e J|g<*?<&4N4t* «• 
suspend, however, can bt • difficult ons, sod in arriving at thJM,da$j^&ej|i, us 
sight gst involved in dslieats trade-offs between different possibl Mtlss. 
The generator would probeblg nssd JHw ability to produce a sst of alternative 
tree, for which different ™..^ of 

critsria to detsrains which „repr.e«|ed fhejget ca^ga^ J»\M tfti ^ to 
sake cosprosiss. would be isportant in a workinj orator, ; ,jnce \J oft«n 
sssss that ths ssssncs of translation is the abj I J tu to find good coaeroaisee. 
If no satisfactory coaproaiss can be found, ths next step Is to tru 

,= -". ■ ■? ■:.■■': i: r ; -« , ;<S--. *■?*!■*-.*■ ?: -;■' ',fi94« ; ' sM yaft* jl*tT 10b*»O -f;^ J - ! - ' *»i" H ' 

paraphrase. I a. using "P^aphrass* ^^.ai^e t^^^f ch^ga 
in the eKplicit eeaning, although not In the total eeaning of a text. Let a* 

which tot., aeaning I. squivaisnt. J*^^ 

in Gerean a. "sin hsllsr Tag- (ntsr^lu^a b^^da^. , Obv^.^ y^ 

.Kg is Car of clouds, «*n ths «f^l>l£^^ If 

ths sun is bright, m expect the ekj to t^cljpr. . .^^if^paSII^-P 

on diff.rsnt.nds of »fc 1*#r*^^lf^ 

statsaents differ., ths *^««t i^ ^. ^.ti H yt r^a^ A s^^a^l^t. , 

This particular •"»l»lg «-W IM ? if«|>Ml ^RPW» "iWnW^tlMPWn^^o 
definitions, by translating btXL as alajt under certain etreuaetaneeet we 
probably would not use a general paraphrase sechaniso. Not all possibls 
paraphrasss, of courss, can bs anticipated in this way. If ths generator 
cahnot translsts a phrass, ws will want it to consider the iapl (cations to eee 
if another equivalent phraee can be found. This will involve a rsturn to the 
deductive coaponont, sines onlg a fraction of the ieptictt aeaning of a 
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■sentence it carried Th tha taaantie raaroeantatien. the deductive coaponent 
uouftf prt iuaab tg f tnd aft oqulvattnt p i r^wm , »lttr tfa Mwntlc 
representation, and t«nd tbla new ropraaintat ion bank to the generator for 
another try. 



Although '.the' gaftifatw currently iipjoaented la very United in ecope* I 
novo triad to forooae the kind* of eittanefona that Mould be neeeaeary. Tha 
Important - f aahret of ; tha generator ara that it ueaa tha Meant I c 
repreojwitMfdn^^^ !• not conetraJned te generate eoftponente 

in tha linear order that they will appear In a aantanca. Ulth tha exteneione 
aad» to PROGfWtWi* mo can oaintetn a generation tree, and it ia poeeible to 
by-pa«« •yntaetic epeciefiet* or daf inltton reutinee, to opacify a daf tnition 
I lot to be ueed, or to axpHcltly ruJo out particu+ar uord choices. Eaphaala 
hd» bean plaeod on aeao problaai that ara traditionally considered atyl letie, 

• f ■ •-■•■* ^"' '. - ■■■■■■ -.-'>■'■ :■■'■; ■■■-■ "' ' "" '■ * 

but whi eh dra of cortatderablo iapertenae far translation. A great daal noro 
analysis need* to bo dona ■•"aw tha aaaantic eotivatlon for partlculer eyntactic 
choices* 'and th* generator would si »o benefit free investigation* of problme 
such aa repot itfveneee and eantanee length, finally, the iaauea dfaeuaaed in 
sect ion S.S - a*etdWg eaotguitu, suspending ruteaV ■ mi paraphraae - mrm 
prob toe* that or* w3o% open for 'i^thar' : reaaaWh/"'"'" ' 
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Chapter 7 — Conclusions 

In the course of describing the iBpleeentotion, 1 hivi discussed eote 
.problems end •oft solutions. Not all pf the orobi eta have been satisfactorily 
eolyed, and aaong those art soee, I tblok ? ,,thj|t art Inteftttitp enough to 
Justify tort intensive research. ( In thje final chapter, let to review Mhat I 
consider to be the atjor probleM encountered ,|n thip project and toko soae 
. reetrke about the different solutions propoesd. 

Hooding tht lift of probltM In eechenical translation Is stl 1 1, of 
course, tht problea of understanding. This was outside tht scope of tho 
project, but I want to eephssiza again horoj that tfut undoetlanding of tho 
•ourco tOKt io crucial to trustworthy trans I at ion. Related to this is tho 
issue of accountability. A user should alueys be oble to taji • tyste« uhat 
choices were Bade and why. Just as a huaan tronslatof oculd give reaeone for 
a particular di tee© i gust ion or word choice, ) think It Is essential that a 
systsa be able to do the sees. This will not guarentee reliability, hut it 
does give the user soae control by giving, hit a chance to catch gape in the 
knowledge base, incorrect assumptions, etc. 

The first problsa encountered in the lepJeaentatlon wee that of parsing 
Garten text. Here, there Mere two difficult erees - Geraan inflection and tht 
relatively wide (coopered to English) syntactic variety, I.e. prenotinel 
clauses, the relative freedot of uord order for verb objects, end-order 
constructions, etc. The former Involved changes to PROCRAftlAR to handle 
multiple feature I lets. These changes were exteneivt* although of a rout I nt 
nature. I f PRQGFWtttR had original ly been written |» hand It Russian or 
Icelandic, the Morphology of Gsraan wouid not havt cote tt such a shock. 
English is biased aJaoat exclusively toward uord order in the linguistic 
trade-off between aorphophoJogy and word order, so one would t*pect to need 
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fairly extensive changes in order to handle ■ More heevilg inflected language. 

Uith respect to actual parsing performance, I would sag that the general 
approach ueed bg ItTnograd is as good at hand I lng Geraan as it is st handling 
English. That is to sag, the per foreenci of the parser leaves no doubt that 
it could someday be extended and embedded in a practice rsyatem. There reaain 
certain trouble spots, however, that are present in English but are 
exacerbated in Geraan. The implementation uses a number of ploge to deal with 
the acre varied syntactic choices of Geraan, but it is not clear that the 
solution is general enough. The recent dissatisfaction expressed about backup 
in language parsing teeee to be wsl l-founded, and It Mill be interesting to 
see the results in this area. 

A keg question In Mechanical translation ie what the input to the 
generator should took like. In designing the implementation, I started uith 
two aeauapt tons about this issue. The first was that for a given target 
language ue can predict the sorts of information that uill be neceeearg for 
generation and the sorts that uill not. Second, I assuaed that eurface 
generation would be enough, that the generator could follow the ganeral 
organization of the source text sentence for eentehba. These are based, tn 

■■:■■■ ' -.v ■■■ 

turn, on the underlying hypothesis that translation of acientlflc prose does 
not need the fut I power of a general purpose generator. Adopting these 
assumptions resulted in a commitment to the use of a eemantic representation 
as input to the generator. In chapter 6, we sau that these assumptions do not 
a I uags hold. Sons situations require paraphrase* and in others ue Might have 
to reetrueture the message entirely. I think they are true often enough, 
houever, to justify substantial differences between the fori of generator a for 
tranelation and general purpose generators. I could be wrong in this, 
houever, and only increesed research will tell whether efflclencg lies in the 
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direction of a special purpose or of a general generating process. 

There are many aspects of the two languages considered here that I would 
like to have investigated in more detail. All of these. 1 think, are 
interesting research areas in their oun right, irrespective of the 
implementation involved. Only a token gesture was made toward ueing 
information about collocations for parsing and generation. It would be 
interesting to see how this information could be used to build up lexical 
expectations about the rest of the sentence, in addition to the syntactic 
expectations currently embedded in the parser. Furthermore, the area of 
generation poses a number of interesting questions, many of which have been 
given only rudimentary answers here. Issues of word choice and sentence 
length deserve more attention. A good deal more analysis neede to be done on 
questions of semantic motivation for surface structure choices. Finally, 
another very interesting problem is that of suspending rules to make good 
generating compromises. 

Throughout this project, I have been continually impressed by both the 
economy of natural language as a communication medium and the variety of its 
mechanisms. I find this convincing evidence that any translating system that 
throws away information, be it syntactic, lexical, or semantic, cannot hope 
for success. In the end, only a total approach to language will offer even an 
initial solution to the translation problem, and a lot of intriguing questions 
still remain unanswered. 
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APPENDIX A. WORD FEATURES 



WORD DEFINITIONS FOR THE INTERPRETIVE GRAMMAR REQUIRE THE FOLLOWING SYNTACTIC 
INFORMATION. 

STARRED FEATURES INDICATE REQUIREO MATCHES IN INFLECTION. FEATURES UITH OOTS 

INDICATE TYPES OF FEATURES: 

PERSON- P1ST P2ND-FAM P2N0-POL P3RD 

GENDER- flASC FEM NEUT 

CASE- NOM GEN OAT ACC 

NUMBER- SING PLUR 



ADJECTIVE, EITHER ONE THAT CAN BE DECLINED OR NOT, 

COMP -FORMS COMPARATIVE 

SUP - FORMS SUPERLATIVE 

IF A SUPERLATIVE OR COMPARATIVE IS ACTUALLY FOUND THEN THE FEATURES SUPERL AND 

COMPAR ARE AOOED. 

ATTRIBUTIVE ADJECTIVES: 
(ADJ ATTR DECL COMP SUP) 

COMPLEMENTS: 

(ADJ NODECL COMP SUP .CASE. ) 

NOOBJ 

ADJECTIVES THAT MODIFY VERBS OR OTHER ADJECTIVES: 
(ADJ RELMOO NODECL COMP SUP ) 

THOSE RELMOD ADJECTIVES THAT MAY NOT APPEAR IN THE FIRST POSITION IN THE 

SENTENCE: 

(ADJ RELMOD NON-FRONTAL) 

POSTNOMINAL ADJECTIVES: 
(ADJ POSTNOM NODECL) 

WAS FUR: 

(ADJ INTER NODECL .CASE.) 

WO, WARUM, WOHIN, ETC.: 
(ADJ INTER) 

WORUBER, WORAUS, WOZU, ETC.: 
(ADJ INTER WO-FORM) 

ADVERB MODIFYING ADJECTIVES AND OTHER ADVERBS: 
(ADV) 

BINDERS: 
(BINDER) 

COORDINATING CONJUNCTIONS: 
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(CONJ) 

DER DIE D AS ETC. i 

tOET DEP .SERBER.*' 1 .CASE. .NUMBER. ) 

DIESE JENEETC: 

(ENDINGS DETERMINE GENDER, CASE « NUMBER) 

COET OEHAOJ) 

EIN KEIN E1NIGE: 

(ENDINGS DETERMINE GENDER. CASE. * NUMBER) 
CDET INDEF SING) 
PLUR 

MELCHEj 

(DET INTER OECU 

UESSEN: 

(DET INTER NODECL P.GEN) 

MEIN DEIN ETC.i ^_^ '^ 

(ENDINGS DETERMINE GENDER. CASE * NUMBER. SEE POBB-SUB6T FDR P.) 
( DET POSS P.PERSON. P.GENBER. P.NUMBER. > 

JA. NEIN, OANKE, AHAi 
(INTERJECTION) 

STRONG NOUNS. THAT IS THOSE WAT HAVE A RERLAR DECLENSION; 
(NOUN STRONG .GENDER. GEN-ES NQPLUR ) 

GEN-S PLURf 

PtiUwHEN 

PLUR-E 

PLUR-ER 

PLUR-N 

PLUR'E 

PLUR"ER 

PUJR" 
IF THE NOUN MAY TAKE MORE THAN ONE GENITIVE OR PLUJWL ENDING, THESE ARE LISTED 
SIDE BY SIDE IN THE FEATURE LIST. RATHER THAN Mi 8PARATE ENTRIES. 

UEAK NOUNS, LIKE SOLOAT, NEN8CH, ETC.t 
(NOUN UEAK .GENDER.) 

MIXED NOUNSt 

(NOUN MIXED .GENDER.) 

NOTEi ALL NOUNS ALSO MAY BE EITHER COUNT, MASS OR PR#H (PROPER NOUN). SHOULD 
A PARTICULAR NOUN BELONG TO MORE THAN ONE OF THESE CATEGORIES THE FEA TURES 
WILL BE LISTED TOGETHER IN THE SAME HAY THAT THE TRANSITIVITY PROPERTIES ARE 

For the verb. 

UEAK AND MIXED NOUNS MAY INCLUDE GENITIVE ENDINGS IN IRREGULAR CASES. 
CARDINAL NUMBER! 
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(nuto 



DIE MEINIGE, DIE ME1N&, ETC. 

uM&T»ilAE^ is 

EVALUATEO. 

(POSS-SUBST P.GENOER. P.PERSON. P.NUMBER ) 



PREPOSITION: 
(PREP .CASE. PRE ) 
POST 

MAN JEMANDET&: 

(ENDINGS SUPPLY CASE ft NUMBER) 

(PRON ABSTRACT INBEF ) 

UER, WAS: 

(PRON INTER .CASE.) 

MEIN- ETC. 

(SEE POSS-SUBST FOR EXPLANATION OF P. ENDINGS OET0WNE GENDER. CASE. + 

NUMBER.) 

(PRON POSS P.GENDER. P.PERSON. P.NUMBER.) 

DA-COT1POUNOS: 
(PRON PREP RELMOO) 

ERSIEESETC.i 

CASE MAY BE EITHER (NOH GEN DAT ACC RBI UEHF) 

(PRON PERS OEF .PERSON. .GENDER. .CASE. .NUMBER. ) 

DAS. UAS, OAj 
(PRON PERS RELMOO) 

EIN. KEIN: 

(ENDINGS SUPPLY CASE ft NUMBER) 

(PRON PERS INDEF ) 

DER. DEREN, UELCHE, ETC.t 

(PRON REL .CASE. .GENDER. .NUMBER.) 

SELBST, ALL. ETC.: 
(QUANT MASS ) 
COUNT 

PARTICLE USEO AS SEPARABLE PREFIX: 
(SEPPR) 



VERB: 








(VERB REG UML 


SEIN 


.TRANSITIVITY. 


-SEPPR .SEPPR.) 


IRR 


HABEN 




♦SEPPR 


MIXED 






INSEPPR 
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IF -SEPPR, THEN THE SEPARABLE PREFIX IS GIVEN. FOR SEPARABLE PREFIXES. BOTH 
PARTS OF THE UORD RUST BE ENTERED SEPARATELY WJ€JICTIQHJIRY. ALTHOUGH 
SEMANTICS NEED ONLY BE JUS? ON THEXOatMO. Mf UfllBjO JHIffr tS LABELED 
UITH ♦SS'PR AND OBEST T NEED Ut MtVlM. WW BwBIFfW, ErfMER HI FEATUHE 
LIST OR 




TRANSITIVITIES AREi 

A+O R*A U+A A*C A*A R«0 R+G A*P A«E RtEZAQflGNUPEI 

I -INTRANSITIVE A-ACGUBATIVE O-OATIVE G^ENITIVE N-NOfltNATrVE R^EFLEXIVE 
U-(FOR UEnFALU DATIVE REFLEXIVE P-PREfOBITlON AS OBJECT WIANKSHIFTED NOW 
GROUP E-ANV ADVERBIAL. SOME VERBS HAVE OBLUWTORY LOCATION. JI«E. ETC. JUST 
UHICH CASE APPLIES IS SPECIFIED ROUGHLY IN SB RICY1QNS. 



IF A VERB HAS KJRE THAN ONE TRANSITIVITY RELATION. 
PARENTHESES M ONE DEFINITION. RATHER THAN RAKING 
THESE LISTS ARE THEN EXPANDED AUTOMATICALLY UHEN 



ARE INCLUDED IN 
LISTS FOR EACH 



«- *k 
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APPENDIX B. SAMPLE PARSE 

EIN DEUTLICH SICHTBARES ZEICHEN FOR DIE lit mmti^^ 'i&LHMm^ 
ERREGUNGEN 1ST DAS SPIEL QER CHROT1ATOFHDREN. 

(((EIN DEUTLICH SICHTBARES ZEICHEN mM£. W^fcWttS^^ 
ERREGUNGEN 1ST DAS SPIEL OER XMjmifflm T 
(CLAUSE MAJOR TOPLEVa DECLARATIVE REGULAR-ORDER) 

((EIN DEUTLICH S I CHTBARfS ZEICHEN FDR WE IB ipVEJ«(8IEI^V|RWWFENDiN 

ERREGUNGENT 

(NG NCfl FULL NOUN DET INDEF NEUT SING P3RD COUNT) 

(EIN (DET INDEF SING NEUT NOTO) 

((DEUTLICH SICHTBARES) (ADJG ATTR NEUT SING NIXED NGNEX) 

(DEUTLICH (ADJ RELMOO UNDEa C0HP SUP)) 

(SICHTBARES (ADJ ATTR OECL NIXED NEUT NOW SING CONP SUP))) 

(ZEICHEN (NOUN STRONG NEUT NOtl SING P3RD GEN-S PLUM COUNT)) 

((FOR DIE IN NERVENSYSTEN VERLAUFENOEN ERREGUNGEN) 
(PREPG SIMPLE) 

(FOR (PREP ACC PRE)) 

((DIE IN NERVENSYST01 VERLAUFENOEN ERREGUNGEN) 
(NG ACC SinPLE DET OEF NOUN PLUR MASC P3RD COUNT) 

(DIE (DET DEF ACC FEN PLUR)) 

((IN NERVENSYSTEN VERLAUFENOEN) 

(CLAUSE RSQ PRESP PRENON NONEX SUBORDINATE FEN PLUR ACC UEAK) 

((in NERVENSYSTEN) 
(PREPG NO-RSQ ADVERBIAL) 

(in (PREP niXED PRE)) 

((in NERVENSYSTEH) 

(NG DAT NO-RSQ DET DEF NOUN NEUT SING P3RD COUNT) 

(IN (DET DEF NEUT OAT SING)) 

(NERVENSYSTEH 

(NOUN STRONG NEUT DAT SING P3RD GEN-S PLUR-E COUNT)))) 

(VERLAUFENOEN 

(PART PRESP OECL UEAK FEH ACC PLUR ATTR IRR UNL SEIN NO-GE 
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MVB PLAIN P AS-VERBW 

(ERREGUNGEN (NOUN STRONG FEB ACC PLUR PJRQ FLUR*EN COUNT) ) J > ) 

(1ST (VERB IRR HVB PLAIN SEIN NO-END N PRES INDIC P3flD SING) ) 

((DAS SPIEL DER CHRDnWOPHQREN) 

(NG NOH FULL DET Wttf& N^JfiUM^O JCOUNT) 

(DAS (DEt DEF NOfl NEUT SlNGJI 

(SPIEL (NOUN STRONG NEUT NOM SINE P3RD GEN-S GEN-ES PUJR-E COUNT)) 

( (DER CHRGflATOPHORENl 

(NG GEN SIMPLE DET DEF NOUN PLUR tIASC P3RD COUNT) 

(DER (DET DEF GEN ttASC PLUR)) 

(CHROMATOPHOREN (NOUN PLUR-EN PLUR P3R0 GEN GEN-S J1ASG .COUNT) ) ) ) ) ) 
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APPENDIX C. A SECTION OF THE CONCEPT MARKER TREE 

EACH CONCEPT MARKER IS LINKED TO ITS PARENT BY THE IE PROPERTY AND TO ITS 
DAUGHTERS, IF ANY, BY QQtfJi. THE ORDERING OF RESTRICTION LISTS CORRESPONDS TO 
THE ORDER OF THE SEMANTIC ARGUMENTS OF THE RELATION. 

(DEFS ^MENTAL-PROCESS UP ^RELATION 
DOWN (^PERCEPTION ^REACTION ^COGNITION)) 

(DEFS ^PERCEPTION UP #MENTAL-PROCESS 
DOUN (0SENSQRY-INVOLUNTARY #SENSORY-YOLUNTARY) ) 

(DEFS 0SENSORY- INVOLUNTARY UP ^PERCEPTION 
DOUN (^DISTINGUISH ^PERCEIVE)) 

(DEFS /^DISTINGUISH UP #SENSORY- INVOLUNTARY 
RESTRICTIONS: (LIVING-THING ^CONCRETE ^CONCRETE)) 

(DEFS ^PERCEIVE UP ^SENSORY- INVOLUNTARY 
DOUN (#SEE) ) 

(DEFS #SEE UP ^PERCEIVE 
RESTRICTIONS: WANIMAL ^CONCRETE)) 

(DEFS 0SENSORY-VOLUNTARY (#PERCEPTION) 
DOUN (OBSERVE)) 

(DEFS ^OBSERVE UP #SENSORY-VOLUNTARY 
RESTRICTIONS: (0HUMAN #OBJECT)) 

(DEFS ^REACTION UP W1ENTAL-PROCESS 
DOUN (W4ISH-FOR)) 

(DEFS WJISH-FOR UP ^REACTION 
RESTRICTIONS: (#HUMAN (EITHER: ^RELATION /WBJECT))) 

(DEFS ^COGNITION UP MENTAL-PROCESS 
DOUN (0NEUTRAL-COGNITION #VALUE-ASSIGNED)) 

(DEFS ^NEUTRAL-COGNITION UP COGNITION 
DOUN (0KNOU) ) 

(DEFS 0KNOU UP 0NEUTRAL-COGNITION 
RESTRICTIONS: (0HUMAN tfFACT)) 

(DEFS #VALUE-ASSIGNEO UP ^COGNITION 
DOUN (0ASSUME)) 
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APPENDIX D. SAfflE SEMANTIC REPRESENTATION 



A saaple repraaantation of the tantenca, "El« aautlteh alchtbaraa Zeichan fGr 
dig in Narvanayataii v*r \mrt*nt*n Err*a»ipari itt da* ft#l*l dar Cl « uwlup l m en 
dar Caphalopoden.* Saaant ic : atraat«raa; pradaaad' far iaxwa ami ttteaa producad 
by datarafnara hava baan oalttad for clarity. 



RSS 



HEAD 



0EQUATE 
TTZ 



HEAD 



immLtmmmtmtm 



flETA 

^SOMETHING 



HEAD 



RSS 
#ABLE 



HEAD 



RSS 
#SEE 



I 



n 



OSS 
•PERSON 



RSS 
•HAVE-PROPERTY 



n 



iip winW 



nSS - 

«PUY 



Hi« 



OSS 
(TCHROflATOPHDRE 

miiiini^iii imttmmttmMmmimtik 



RSS 
JHAVE-AS-PART 






JT 



OSS 



JCEPHALOPOD 



PSS 
#0PTinun-C0GNITIVE 



MHM*«ri««WnMMWMMNMM 



SEE NEXT PME 

=1 
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APPENDIX D. (CONO.) 



SEE PREVIOUS PAGE 



RSS 



VINDICATE 



HEAD 



OSS 
^EXCITATION 



RSS 
#RUN-AROUND 



RSS 
^CONTAINMENT 



n 



OSS 
WCRVOUS-SYSTEM 
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